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EXPRESSION MODULATING SEQUENCES 



FIELD OF THE INVENTION 



5 The present invention relates generally to a method for modulating expression of a genetic 
sequence and to agents useful for same. More particularly, the present invention provides a 
means for modulating expression of a genetic sequence by introducing, creating or deleting 
one or more pseudo-translation initiation sites in the nucleotide sequence of an mRNA, 
upstream, i.e. 5', of the authentic translation initiation site of an open reading frame. The 

10 present invention provides further modulation of expression by introducing, creating or 
removing Kozac or Kozac-like sequences genetically proximal to the pseudo-translation 
initiation site(s). Modulation of expression is further manipulated by the introduction, 
creation or removal of a termination signal prior to the authentic translation initiation site 
or after this site but in a different reading frame relative to the reading frame determined 

1 5 by the authentic translation initiation site. The present invention further provides genetic 
agents including a plurality of nucleic acid molecules each with a predetermined number of 
pseudo-translation initiation sites and/or pseudo-open reading frames (ORFs) wherein each 
sequence influences or otherwise contributes to a particular level of expression for genetic 
sequences operably linked or associated to the 3' end of said nucleic acid molecules. The 

20 level of expression of the genetic sequences is commensurate with a selected nucleic acid 
molecule which becomes a 5' untranslated or leader region (5'UTR) of said genetic 
sequence. The present invention still further contemplates a method for detecting a disease 
condition such as cancer or a proliferative disorder wherein the disease condition is 
associated with a particular level of expression of a gene or other genetic sequence. Such a 

25 method is predicated in part on identifying a particular 5'UTR or 5'UTR-encoding 
sequence or the level of pseudo-translation initiation sits therein alone or in combination 
with pseudo-ORFs which provides an indication as to the likely level of expression of said 
gene or genetic sequences. The ability to modulate the level of expression of a genetic 
sequence is useful, inter alia, for gene therapy applications and for expressing traits at 

30 selective levels in plants. Such traits include herbicide and pesticide resistance. 
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BACKGROUND OF THE INVENTION 

Bibliographic details of the publications numerically referred to in this specification are 
collected at the end of the description. 

5 

Reference to any prior art in this specification is not, and should not be taken as, an 
acknowledgment or any form of suggestion that this prior art forms part of the common 
general knowledge in Australia or any other country. 

10 For most eukaryotic mRNAs, translation initiates with the binding of the cap binding 
protein to the mRNA cap structure. This is then followed by the binding of several other 
translation factors, as well as the 43S ribosome pre-initiation complex. This complex 
travels down the 5' region of the mRNA molecule while scanning for an initiation codon in 
an appropriate sequence context. Once the initiation codon has been found and with the 

15 addition of the 60S ribosomal subunit, the complete 80S initiation complex initiates protein 
translation (1,2,3). A second class of mRNAs have been identified which possess 
translation initiation features different from those described above. Translation from these 
mRNAs initiates in a cap-independent manner and is believed to initiate with the ribosome 
binding to internal portions of the leader sequence (4,5,6,7). Most 5' untranslated leader 

20 sequences are very A,U rich and are predicted to lack any significant secondary structure. 
One of the early steps in translation initiation is the relaxing or unwinding of the secondary 
mRNA structure (6). Messenger RNA leader sequences with negligible secondary structure 
may not require this additional unwinding step and may, therefore, be more accessible to 
the translation initiation components. The ability of a leader sequence to interact with 

25 translational components may play a key role in affecting levels of subsequent gene 
expression. 

In work leading up to the present invention, the inventors sought to identify 5' leader 
sequences which might be responsible for the regulation of gene expression. The gene 
30 initially chosen for investigation was the GUI gene. 
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The gene GUI was originally isolated as a highly amplified gene in a malignant glioma (8) 
and subsequently implicated in the development of other tumor types, including 
liposarcoma, rhabdomyosarcoma, osteosarcoma and astrocytoma (9,10). It has been shown 
5 that GUI encodes a transcription factor which is a downstream nuclear component of the 
Sonic Hedgehog/Patched (SHH/PTC) signalling pathway (11,12,13). This pathway is 
evolutionarily conserved and found to operate in a number of tissues during vertebrate 
development and especially in regions involving mesoderm-ectoderm interactions 
(14,15,16,17). Intercellular signalling by this pathway is initiated when SHH (a secreted 
10 protein) binds to PTC (a cell-surface transmembrane protein), resulting in the activation of 
GLI1 in the nucleus and subsequent expression of target genes. Over-expression of SHH 
has been shown to upregulate GUI in the chick limb buds and in the epidermal ectoderm 
of frog embryos (18,19) whereas, GUI expression is undetectable in SHH null embryos 
(20,21), confirming that SHH signalling regulates GUI expression. 

15 

The discovery of PTC mutations in familial and sporadic forms of basal cell carcinoma 
(BCC), the most common skin cancer, has associated aberrant signalling of the SHH/PTC 
pathway with the formation of these tumors (22,23,24). The genetic data are supported by 
experimental evidence showing that over-expression of SHH and other components of this 

20 pathway results in the induction of BCCs in transgenic mice and transgenic human skin 
(25,26,27). Over-expression of GUI produces BCC-like lesions in transgenic tadpoles (28) 
and transforms rodent epithelial cells in cooperation with adenovirus EI A (29) indicating 
that unregulated expression of GUI is oncogenic. Studies have shown that GUI 
expression is greatly increased in BCCs but not in the surrounding normal tissue consistent 

25 with a central role in rumor formation (28,30,3 1 ). 

In addition to GLI1, two other isoforms have been identified in vertebrates (termed 
•GLI2* and *GLI3*), each encoded by a separate gene (15,17). The GLI genes are highly 
expressed during development and their expression profiles correlate with organogenesis 
30 but show only low level expression in most adult tissues (14,16,17). In the skin, GUI 
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expression is readily observed in the epidermal compartment of the developing hair follicle 
whereas GLI2 and GLI3 transcripts were detected in the surrounding mesenchyme 
(17,28,31). The role of each GLI in mediating the SHH signal is not yet clear but recent 
gene ablation studies on GLI2 and GLI3 have shown overlapping roles and indicated some 
5 functional redundancy (32). A number of studies have indicated that GLI J encodes a 
transcriptional activator, whereas GLI2 and GL13 encode factors which can act as both an 
activator or a repressor depending on specific post-translational modifications (33,34,35). 
Interestingly, GLI2 and GL13 are now thought to regulate GLI1 transcription directly by 
binding to the GLI I promoter (33,35). 

10 

In accordance with the present invention, the subject inventors have now identified 
alternative 5'UTRs of GLI1 transcripts in mammalian tissues which are generated by exon 
skipping and which confer marked differences in translation efficiency. The inventors • 
results indicate that post-transcriptional regulation of GLll is mediated by the 5'UTR 
15 generated through exon skipping and show an association of the most efficiently translated 
5'UTR transcript with BCC and cellular proliferation. 

More particularly, the inventors surprisingly determined that by altering the number of 
sequence elements corresponding to pseudo-translation initiation sites, i.e. RUG or RTG 

20 triplets (where R is A or G), within the leader sequence of a nucleic acid molecule and 
prior to the authentic translation initiation site alone or in combination with termination 
signals in the 5' UTR or in a different reading frame within the coding sequence and/or in 
the 3' UTR, it is possible to modulate gene expression. In a further determination, the 
present inventors identified that altering nucleotide sequences proximal to the RUG or 

25 RTG triplets within a leader sequence also permitted the regulation of gene expression. 
Thus, the subject inventors have developed a method for the regulation of gene expression 
in eukaryotic cells including animal cells and plant cells. Furthermore, the identification of 
expression modulating sequences enables determination to be made as to expected levels 
of gene expression such as in certain disease conditions or to express traits at selective 

30 levels in animal and plant cells. 
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SUMMARY OF THE INVENTION 

Throughout this specification, unless the context requires otherwise, the word "comprise", 
or variations such as "comprises" or "comprising", will be understood to imply the 
5 inclusion of a stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer or group of elements or integers. 

Nucleotide and amino acid sequences are referred to by a sequence identifier number (SEQ 
ID NO:). The SEQ ID NOs: correspond numerically to the sequence identifiers <400>1, 
10 <400>2, etc. A sequence listing is provided after the claims. 

The present invention is predicated in part on the proposition that the level of expression of 
a genetic sequence is proportional or otherwise dependent or influenced by the number of 
pseudo-translation sites in the 5' leader sequence upstream of the authentic translation 

1 5 initiation site alone or in combination with pseudo-open reading frames (pseudo-ORFs) in 
or beginning in the 5' leader sequence. A pseudo-ORF is also known as an upstream ORF 
(uORF). The authentic translation initiation site is regarded as the RTG or RUG triplet, 
where R is A or G, at the beginning of an open reading frame or coding sequence. A 
pseudo-translation initiation site is regarded as an RTG or RUG located in the 5' nucleotide 

20 sequence upstream of the authentic translation initiation site. The term "pseudo" is used to 
distinguish these sites or triplets from the authentic RTG or RUG triplet. In accordance 
with the present invention, it is proposed that elevated expression of a genetic sequence, 
which comprises an open reading frame and an authentic translation initiation site and a 
leader sequence upstream of said authentic translation initiation site, occurs when 

25 RTG/RUG triplets are removed or destroyed in the upstream sequence. Conversely, 
expression is reduced by introducing or creating pseudo-translation initiation sites. 
Furthermore, pseudo-ORFs may be present or created or removed by the introduction of 
one or more termination signals within the 5' leader sequence or in a different reading 
frame within the coding sequence or within the 3'UTR. 

30 
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One of the many implications of the present invention is the ability to design genetic 
elements which permit the controlled expression of a genetic sequence, to which they are 
operably connected or associated, to expected levels in animal, mammalian (including 
human) and plant cells. For example, a plurality of genetic elements are produced each 
5 with a different number of pseudo-translation initiation triplets, i.e. RTG or RUG where R 
is A or G. Where high expression of a genetic sequence is required, a genetic element is 
selected with no or few pseudo-translation initiation sites and this is fused to the 5' end of 
the genetic sequence or inserted into a 5' leader sequence associated with said genetic 
sequence. When the resulting construct is operably linked to a promoter, higher expression 
10 of the genetic sequence is expected relative to a genetic element comprising a greater 
number of pseudo-translation initiation sites. Modulation of expression further comprises 
the introduction or removal of termination signals to create pseudo-ORFs in or beginning 
in the 5* leader sequence and terminating in the 5' leader sequence, within the main ORF 
(authentic ORF) but in a different reading frame and/or in a 3'UTR. 

15 

A plurality of genetic constructs may also be generated each comprising a genetic element 
having a predetermined number of pseudo-translation initiation sites. When expression of a 
genetic sequence is required, the genetic sequence is ligated into the construct downstream 
of the genetic element. 

20 

The identification of genetic markers of expression, i.e. the number of pseudo-translation 
initiation sites and optionally pseudo-ORFs, further enables the determination of the likely 
level of expression of a gene under investigation such as a gene involved in disease 
including cancer or a proliferative disorder. Furthermore, various traits may be expressed 
25 at selective levels in animal and plant cells 

The present invention provides, therefore, a method for diagnosing or predicting the likely 
level of expression of a target gene based on the number of pseudo -translation initiation 
triplets in its 5' sequence, upstream of the authentic translation initiation site and/or the 
30 number of pseudo-ORFs in the target gene in animal and plant cells. 
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Accordingly, the present invention contemplates for modulating the expression of a genetic 
sequence wherein said sequence comprises an ORF having an RTG or RUG wherein R is 
A or G corresponding to an authentic translation site of said ORF and a nucleotide 
5 sequence 5' of said authentic translation start site, said method comprising introducing or 
removing one or more RTG or RUG triplets in said 5' nucleotide sequence upstream of 
said authentic translation initiation site such that upon expression of said genetic sequence, 
there is a respective decrease or increase in the level of expression. 

10 The modulation of expression of a subject genetic sequence may also involve manipulating 
nucleotide sequences proximal to a pseudo-translation initiation site to introduce, create or 
remove Kozac or Kozac-like sequences and/or introducing or removing termination signals 
to create or destroy pseudo-ORFs in the nucleotide sequence. The presence of such a 
sequence is proposed to enhance the function of a pseudo-translation initiation site thereby 

1 5 further decreasing expression of a downstream genetic sequence. 

Another aspect of the present invention contemplates a method for modulating the 
expression of a genetic sequence wherein said sequence comprises an ORF having an RTG 
corresponding to an authentic translation initiation site of said ORF where R is A or G and 
20 a nucleotide sequence 5' of said authentic translation initiation site, said method 
comprising introducing or removing one or more RTG triplets in said 5* nucleotide 
sequence upstream of said authentic translation initiation site such that upon expression of 
said genetic sequence, there is a respective decrease or increase in the level of expression. 

25 Yet another aspect of the present invention further contemplates a method for modulating 
the expression of a genetic sequence wherein said sequence comprises an ORF having an 
RUG corresponding to a translation initiation site of said ORF where R is A or G and a 
nucleotide sequence 5' of said translation start site, said method comprising introducing or 
removing one or more RUG triplets in said 5' nucleotide sequence upstream of said 

30 authentic translation initiation site such that upon expression of said genetic sequence, 
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there is a respective decrease or increase in the level of expression. 

Still another aspect of the present invention contemplates, therefore, a method for 
facilitating increased or elevated expression of a genetic sequence wherein said sequence 
5 comprises an ORF having an RTG or RUG wherein R is A or G corresponding to an 
authentic translation initiation site of said ORF and a nucleotide sequence 5' of said 
authentic translation initiation site, said method comprising removing one or more RTG or 
RUG triplets in said 5' nucleotide sequence upstream of said authentic translation initiation 
site such that upon expression of said genetic sequence, there is an increase in the level of 
10 expression relative to expression of the genetic sequence in the absence of removal of any 
RTG or RUG triplet. 

Still yet another aspect of the present invention provides a method for facilitating increased 
or elevated expression of a genetic sequence wherein said sequence comprises an ORF 

1 5 having an RTG wherein R is A or G corresponding to an authentic translation initiation site 
of said ORF and a nucleotide sequence 5' of said authentic translation initiation site, said 
method comprising removing one or more RTG triplets in said 5' nucleotide sequence 
upstream of said authentic translation initiation site such that upon expression of said 
genetic sequence, there is an increase in the level of expression relative to expression of 

20 the genetic sequence in the absence of removal of any RTG triplet. 

Even yet another aspect of the present invention relates to a method for facilitating 
increased or elevated expression of a genetic sequence wherein said sequence comprises an 
ORF having an RUG wherein R is A or G corresponding to an authentic translation 
25 initiation site of said ORF and a nucleotide sequence 5' of said authentic translation 
initiation site, said method comprising removing one or more RUG triplets in said 5' 
nucleotide sequence upstream of said authentic translation initiation site such that upon 
expression of said genetic sequence, there is an increase in the level of expression relative 
to expression of the genetic sequence in the absence of removal of any RUG triplet. 



30 
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A further aspect of the present invention contemplates a method for facilitating decreased 
or reduced expression of a genetic sequence wherein said sequence comprises an ORF 
having an RTG or RUG where R is A or G corresponding to an authentic translation 
initiation site of said ORF and a nucleotide sequence 5' of said authentic translation 
5 initiation site, said method comprising introducing or creating one or more RTG or RUG 
triplets in said 5' nucleotide sequence upstream of said authentic translation initiation site 
such that upon expression of said genetic sequence there is a decrease in the level of 
expression relative to expression of the genetic sequence in the absence of introducing or 
removing any RTG or RUG triplets. 

10 

Another aspect of the present invention contemplates, therefore, a method for modulating 
the expression of a genetic sequence wherein said sequence comprises an ORF having an 
RTG or RUG corresponding to an authentic translation initiation site and a nucleotide 
sequence 5' of said authentic translation start site comprising the sequence:- 

15 

{n^.nJJx^^LlnX 

[x^^?Hn!^^..ni I '} s [y!^;^-]RT/UG[z 1 z 2 ...z n ] t 

wherein: 

20 RT/UG is the authentic translation initiation site and R is A or G; 

n, n 1 , n" and n 1 " are nucleotides selected from A, T or U, C or G or I; 
{n,n 2 ...n a } m , {n^ ...n[ } 0 , {tfn? .y } p and {nj"n»' ...n™ } s represent 

nucleotide sequences of a, b, c or d nucleotides in length and where each of n, n 1 , n 11 and 
n 111 may be the same or different and its position is indicated by the subscript numeral i, 2 , 

25 

[z,z 2 ...z n ] represents a translation termination signal within an authentic 
ORF but not in the same reading frame as said authentic ORF; 

each of m, n, o, p, q, r or s may be the same or different and each is 0 or 1 or 
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if there is a repeat or multiple repeats, from about 2 or about 10; 
t is 0, 1 or >1; 

each of [x^xj,,, [x|xjx^] p and [xj'xjxj], is selected from the triplet 

RTG, RUG, RYG, RTY 1 , RY Il G, RUY 111 , ATG, GTG, AUG and GUG where R is A or G, 
5 and each of Y, Y 1 , Y [l and Y 111 may be the same or different and each is a nucleotide with 
the proviso that Y is not T, Y 1 is not G, Y 11 is not U and Y HI is not G; 

each of [yly^y'], [yl'yj'y?] and lyWy?'] represents a translation 
termination signal; and 

said method comprising altering the nucleotide triplets [x[X 2 x 3 ] n , 

10 [xjxjxljp and/or [x|'x"x"] r to introduce or remove an RTG or RUG to thereby 
respectively decrease or increase the level of expression of said genetic sequence. 

Yet another aspect of the present invention provides a nucleotide sequence for use in 
modulating the expression of a genetic sequence wherein said genetic sequence comprises 
15 a coding region comprising a translation initiation site and optionally a 5' leader sequence 
such that said nucleotide sequence comprising a predetermined number of RTG or RUG 
triplets such that upon operable linkage to the 5' end of the genetic sequence, the level of 
expression of said genetic sequence is determined by the number of RTG/RUG triplets. 

20 Still another aspect of the present invention further provides a plurality of nucleotide 
sequences each comprising a predetermined number of RTG or RUG triplets wherein R is 
A or G wherein said predetermined number is from 0 to about 100 and preferably from 
about 0 to about 50 and even more preferably from about 0 to about 15 such that to 
facilitate expression of a genetic sequence to a particular level, a particular nucleotide 

25 sequence is selected and inserted or placed between a promoter and said genetic sequence 
such that the promoter is operably linked to said genetic sequence and wherein the level of 
expression is inversely functionally associated with the number of RTG or RUG triplets. 
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Still yet another aspect of the present invention is directed to an isolated nucleic acid 
molecule comprising a nucleotide sequence encoding a 5' mRNA sequence, said mRNA 
sequence selected from SEQ ID NO: 1 to SEQ ID NO: 5 or a nucleotide sequence having at 
least 60% similarity to one or more of SEQ ID NO:l to SEQ ID NO:5 or a nucleotide 
5 sequence capable of hybridizing to one or more of SEQ ID NO: 1 to SEQ ID NO:5 or a 
complementary form thereof under low stringency conditions wherein said nucleotide 
sequence is capable of influencing the level of expression of a genetic sequence operably 
linked to said nucleotide sequence. 

10 Even yet another aspect of the present invention provides an isolated nucleic acid molecule 
comprising a nucleotide sequence encoding a non-translated mRNA sequence, said 
nucleotide sequence selected from SEQ ID NO:33 to SEQ ID NO:36 or a nucleotide 
sequence having at least 60% similarity to one or more of SEQ ID NO:33 to SEQ ID 
NO:36 or a nucleotide sequence capable of hybridizing to one or more of SEQ ID NO:33 

15 to SEQ ID NO: 36 or complementary forms thereof under low stringency conditions 
wherein said nucleotide sequence is capable of influencing the level of expression of a 
genetic sequence operably linked to said nucleotide sequence. 

A further aspect of the present invention contemplates a genetic construct comprising a 
20 promoter linked to a genetic element and one or more restriction endonuc lease sites to 
facilitate insertion of a nucleotide sequence to be expressed by said promoter wherein said 
genetic element comprises a predetermined number of pseudo-translation initiation 
RTG/RUG triplets wherein R is A or G such that the level of expression of said nucleotide 
sequence by said promoter is inversely functionally associated with the number of 
25 RTG/RUG triplets in said genetic element. 

Another aspect of the present invention contemplates a method for modulating expression 
of a genetic sequence, wherein said genetic sequence comprises an ORF with an authentic 
translation initiation site and further comprising a sequence upstream of said authentication 
30 translation initiation site where said method comprises introducing, creating or removing 
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one or more pseudo-translation initiation triplets having the structure RWG wherein R is A 
or G and W is T or U in combination with introducing, creating or removing a Kozac or 
Kozac-like sequence proximal to said RWG such that the number of RWG triplets and 
Kozac or Kozac-like sequences is inversely functionally associated with expression of said 
5 genetic sequence. 

In all the above aspects, the genetic sequence may be further manipulated to introduce or 
remove termination signals thereby creating or destroying pseudo-ORPs within the genetic 
sequence. 

10 

Another aspect of the present invention provides an isolated nucleic acid molecule 
comprising a nucleotide sequence encoding a non-translated mRNA sequence, said 
nucleotide sequence selected from SEQ ID NO: 59 or 60 or a nucleotide sequence having at 
least 60% similarity to one or more of SEQ ID NO:59 or 60 or a nucleotide sequence 
15 capable of hybridizing to one or more of SEQ ID NO: 59 or 60 or complementary forms 
thereof under low stringency conditions wherein said nucleotide sequence is capable of 
influencing the level of expression of a genetic sequence operably linked to said nucleotide 
sequence. 
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A summary of sequence identifiers used throughout the subject specification is provided 
below. 



SUMMARY OF SEQUENCE IDENTIFIERS 



SEQUENCE ID 


DESCRIPTION 


NO: 




1 


Mct-UTR mRNA 


2 


Mp-UTR mRNA 


3 


My-UTR mRNA 


4 


HP-UTR mRNA 


5 


Hy-UTR mRNA 


6 


Synthetic 


7 


Synthetic 


8 


Synthetic 


9 


Synthetic 


10 


Synthetic 


11 


Synthetic 


12 


DNA Primer for mouse GLI exon 1 


13 


DNA Primer for mouse GLI exon 2 


14 


DNA Primer for mouse GLI exon 1 with restriction site for Nhel 


15 


DNA Primer for mouse GLI exon 2 with restriction site for Agel 


16 


DNA primer for mouse RACE 1 exon 4 


17 


DNA primer for mouse RACE2 exon 2/3 


18 


DNA primer for mGHFla exon la 


19 


DNA primer for mGliMFl 


20 


DNA primer for mGliMF2 


21 


DNA primer for mGliMF3 


22 


DNA primer for mGliMF4 


23 


DNA primer for mGHMRl 


24 


DNA primer for mGliMR2 


25 


DNA primer for mGliMR3 


26 


DNA primer for mGliMR4 


27 


DNA primer for mGliFl Ham 


28 


DNA primer for mGHR2 Bgl 


29 


DNA primer for mGliFl Nhe 


30 


DNA primer for mGliR2 Age 


31 


DNA Primer for human GLI exon 1 


32 


DNA Primer for human GLI exon 2 


33 


DNA Mouse GLI UTR partial genomic sequence 


34 


DNA Mouse GLI UTR partial genomic sequence 


35 


DNA Human GLI UTR partial genomic sequence 
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SEQUENCE ID 
NO: 


DESCRIPTION 


36 


r\\! A TT /~"« T T T [Tn i ■ 1 

DNA Human OL1 UTR partial genomic sequence 


37 


r\\i i ' £~ f T 1 ft P T 

DNA primer tor GL1 1 5 -Sad 


38 


DNA primer for OLI 3 -Ncol 


39 


Pnmer 


40 


Primer 


41 


Ma-UTR 


42 


Ma-UTR + 1 AUG 


43 


Ma-UTR + 2 AUG 


44 


Ma-UTR + 3 AUG 


45 


1 d T TP | 1T\ M AT 

Ma-UTR + 4 AUG 


46 


» J T TT^T"* i CAT T/^V 

Ma-UTR + 5 AUG 


47 


Mp UTR + 1 AUG 


48 


Mp UTR + 2 AUG 


49 


Mp UTR + 3 AUG 


50 


\ A 1 TTTl i 1 AT r 

My UTR + 1 AUG 


C 1 

5 1 


My U 1 K + 2 AUG 


52 


My UTR + 3 AUG 


53 


Hp UTR + 1 AUG 


54 


HP UTR + 2 AUG 


55 


Hp UTR + 3 AUG 


56 


Hy UTR + 1 AUG 


57 


Hy UTR + 2 AUG 


58 


Hy UTR + 3 AUG 


59 


Mouse GLI1 genomic sequence (exons in bold) 


60 


Human GUI genomic sequence (exons in bold) 
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BRIEF DESCRIPTION OF THE FIGURES 



Figure 1 is a representation of the sequence, exor/composition and pre-mRNA structure of 
v / the alternative 5'UTRs of mouse GUI, Pgiiel A, sequence alignment of the three 
5 alternative GUI 5'UTR variants (denoted ct-UTR, p-UTR and y-UTR) expressed in mouse. 
The novel 119 bp sequence of exon la is shown in bold lowercase lettering. The ATG 
codons denoting the beginning of uORFs are underlined and the main ORF encoding GUI 
is shown bold uppercase lettering. The intron/exon boundaries are indicated by arrows. 
Panel B, schematic showing the exon composition of the alternative 5'UTRs and the 
10 organization of the pre-mRNA from which they are derived. Exons are denoted by open 
boxes and introns by solid lines with intron size shown. The translation start site (ATG) of 
the main ORF is located in exon 2 and indicated by a bent arrow. 
/ 

Figure 2 is a photographic representation showing that expression of alternative GUI 
15 5'UTRs is not tissue specific but does show strain variation. RT-PCR was performed on 
mRNA isolated from brain (Br.), liver (Li.), Lung (Lu.), skin (Sk.), stomach (St.) and 
tongue (To.) tissues of a postnatal mouse (panel A). A signal for the p-UTR and y-UTR 
variants is also present in all tissues examined but barely discernible. Panel B, in BALB/c, 
DBA and C57B1/6 strains, the a-UTR variant is the major transcript (BALB/c is shown in 
20 lane 1), whereas the P-UTR variant is the dominant transcript in CD-I and SV129 strains 
(CD-I is shown in lane 2)The DNA marker (M) is also indicated. 



Figure 3 is a photographic representation showing that expression of the GUI 5'UTR 
variants is altered by TPA treatment. The expression of the y-UTR transcript is increased, 
25 whereas expression of the a-UTR is reduced, in TPA treated skin (+) relative to control 
skin (-). The DNA size marker is indicated (M). 

/ 

Figure 4 is a representation of the sequence, exon composition and pre-mRNA structure of 
alternative human GUI 5'UTRs. Panel A, sequence alignment of the alternative GUI 
30 5'UTR variants identified in human tissues (denoted P-UTR and y-UTR). The novel 144 bp 
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sequence of exon la is shown in bold lowercase lettering. The ATG codons denoting the 
beginning of uORFs are underlined and the main ORF encoding GL11 is shown in bold 
uppercase lettering. The intron/exon boundaries are indicated by arrows. Panel B, 
schematic showing the exon composition of the alternative 5'UTRs and the organization of 
5 the pre-mRNA from which they are derived. Exons are denoted by open boxes and introns 
^ by solid lines with intron size shown. The translation start site (ATG) of the main ORF is 
located in exon 2 and indicated by a bent arrow. Panel C, comparison of human exon 1 a 
sequences with mouse exon la. Identical bases are indicated with *. 

10 Figure 5 is a photographic representation of the detection of human GLI1 5'UTR variants 
by RT-PCR. RT-PCR results from skin, brain, the HaCaT human keratinocyte cell line 
(HaC), two BCC biopsies and a no-RNA control (dFbO) are shown. A DNA size marker 
(M) is also included. 

15 Figure 6 is a representation of the GLI1 5'UTR variants differentially express a GFP 
reporter construct in transfected cells. Panel A, a diagram showing the GFP constructs used 
in the transfection studies. The mouse GUI 5'UTR sequences were cloned upstream of the 
GFP ORF as indicated. pEGFP-Nl represents the parent GFP construct. Panel B, GFP 
fluorescence observed in Cos-1 cells transfected with the GFP expression vector alone 

20 (GFP), a-UTR- (a). P-UTR- (P) and y-UTR- (y) GFP constructs. Transfection efficiency 
was determined by counting GFP expressing cells, which revealed that each construct was 
transfected with the same efficiency (within experimental limits). 

Figure 7 is a representation of flow cytometry analysis of 5'UTR-GFP constructs in 
25 transfected cells. Panel A shows a fluorescence intensity histogram compiled from the 
analysis of twenty thousand cells per sample for each construct (see Figure 6) in 
transfected Cos-1 cells. The peak closest to the vertical axis is due to untransfected cells 
(negative cells) which show low fluorescent intensities due to autofluorescence. The 
second peak represents fluorescence from transfected cells that express GFP. The four 
30 histograms representing each GFP construct were merged to allow direct comparison of 
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fluorescence intensities. This analysis revealed that the y-UTR construct produced the 
highest GFP intensities (mean value 1.045 x 10 3 ) and the a-UTR the lowest (mean value 
6.2 x 10 1 ). Panel B, graphical representation of GFP intensities of the 5'UTR variants 
relative to the empty GFP vector (defined as 100) in transfected BHK-21 (BHK), Cos-1 
5 and HaCaT cells. Panel C, graphical representation of the ratios of GFP intensities 
between 5'UTR variants in the cell lines indicated. The y-UTR/a-UTR (■ y/a) and y- 
UTR/p-UTR (□ y/|3) ratios are shown for constructs transfected into BHK-21 (BHK), Cos- 
1, primary mouse skin fibroblasts (MSF) and HaCaT cells. The largest differences in GFP 
fluorescence between the y-UTR and the a- and p-UTR variants was observed in HaCaT 
10 cells. The values shown are the averages of three independent transfection and cell sorting 
experiments. 

Figure 8 is a graphical representation showing flow cytometry analysis of mutant 5TJTR- 
GFP constructs in transfected cells. A graphical representation of GFP intensities of wild- 
15 type a-UTR (a), mutant a-UTR (a mut ), wild-type y-UTR (y) and the 4mer y-UTR (y x4 ) 
constructs relative to the empty GFP vector in Cos-1 cells. The values shown are the 
averages of three independent transfection and cell sorting experiments. 

Figure 9 is a representation showing that increasing numbers of uORFs in a 5' leader 
20 sequence prior to a reporter gene decreased the level of expression represented as a 
percentage. 

Figure 10 is a diagrammatic representation of plasmid constructs (A) pSLJ4D4 and (B) 
P UQC1411. 

25 

Figure 11 is a diagrammatic representation of plasmid constructs (A) pUQC1451, (B) 
pUQC1461 and (C) pUQC1471. 

Figure 12 is a diagrammatic representation of plasmid constructs (A) pUQC1483, (B) 
30 P UQC1495,(C)pUQC1501 and (D) pUQC1511. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is predicated in part on the elucidation that the level of expression of 
a genetic sequence is related to the presence and/or number of pseudo-translation initiation 
5 sites located 5', i.e. upstream, of an authentic translation initiation start site of an open 
reading frame (ORF) or coding sequence and the presence or absence of a Kozac or 
Kozac-like sequence genetically proximal to RTG triplets in DNA or RUG in RNA where 
R is adenine (A) or guanine (G). Manipulation of the nucleotide sequence 5' of the 
authentic translation initiation site to introduce, create or remove RTG/RUG triplets and/or 

10 to introduce Kozac or Kozac-like sequences provides a means of reducing or elevating, i.e. 
modulating, the level of expression of a transcript of the DNA in an eukaryotic cell such as 
an animal (e.g. insect, fish, amphibian), mammalian (e.g. human, primate, livestock animal 
or laboratory test animal) or lower eukaryotic cell.. The genetic sequence may be further 
manipulated to introduce or remove termination signals to create or destroy pseudo-ORFs 

15 within the genetic sequence. A pseudo-ORF may be located entirely within a 5'UTR or 
may extend into the coding region, referred to herein as the "authentic ORF" or "main 
ORF". In the case of the latter, there would necessarily be a stop codon not in the same 
reading frame as the authentic ORF. The pseudo-ORF may even extend into the 3'UTR. 

20 The term "expression" in this context particularly encompasses translation into a 
translation product and in particular a polymer of two or more amino acids. The presence 
or absence of RTG/RUG sites and/or Kozac or Kozac-like sequences also has diagnostic 
value in the prediction of the likely level of expression of a particular target gene. 
Furthermore, a plurality of genetic elements in the form of nucleic acid molecules may be 

25 produced each with differing levels of RTG/RUG triplets and genetically proximal Kozac 
or Kozac-like sequences and used to generate genetic constructs conferring defined levels 
or predetermined expectations of levels of expression of nucleotide sequences operably 
linked or fused to each of the nucleic acid molecules. 

30 Accordingly, the present invention contemplates a method for modulating the expression 
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of a genetic sequence wherein said sequence comprises an ORF having an RTG or RUG 
wherein R is A or G corresponding to an authentic translation site of said ORF and a 
nucleotide sequence 5' of said authentic translation start site, said method comprising 
introducing or removing one or more RTG or RUG triplets in said 5' nucleotide sequence 
5 upstream of said authentic translation initiation site such that upon expression of said 
genetic sequence, there is a respective decrease or increase in the level of expression. 

Reference herein to "modulating" means increasing or decreasing the level of expression 
of a genetic sequence. Generally, expression of a DNA sequence includes transcription to 

10 an mRNA sequence and optionally also translation of the transcript to a translation 
product, generally a polymer of two or more amino acids such as a peptide, polypeptide or 
protein. Expression of an RNA and in particular an mRNA sequence generally means 
translation into a translation product. Accordingly, expression covers transcription and/or 
translation and events including post-transcriptional and post-translational events. In a 

15 particularly preferred embodiment, expression is determined at the level of translation of 
an mRNA molecule. 

The term "authentic" translation initiation site is used to distinguish between an RTG or 
RUG, where R is A or G at the beginning of the authentic (i.e. main) ORF and RTG/RUG 

20 triplets upstream of the authentic site. The upstream RTG/RUG triplets are referred to 
herein as "pseudo-translation initiation sites" as they may have some role in ribosome 
binding at that site but this is not the correct site for translation of the authentic ORF. The 
term "pseudo-open reading frame" or "pseudo-ORF" means a translatable region between 
initiation and termination signals within the 5' leader sequence or beginning in the leader 

25 sequence but terminating within the coding region downstream of the authentic translation 
initiation site or in the 3' region. A termination signal in the main ORF would be in a 
different reading frame. 

The term "genetic sequence" includes a DNA or RNA and comprises an ORF having an 
30 RTG or RUG translation initiation site where R is A or G. This is regarded as the authentic 
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translation initiation site. The genetic sequence further comprises a nucleotide sequence 5' 
of the authentic translation initiation site. The 5' sequence may also be referred to as a 
"leader sequence", "upstream sequence", "5' non-coding sequence" or 5'UTR. The genetic 
sequence provides information necessary to direct the transcription of a DNA molecule or 
5 the translation of an mRNA molecule. The term "genetic sequence" includes nucleotide 
sequences defining exons, introns, terminators, promoters including enhancers and 
silencers and the coding and non-coding regions of an ORF or gene. Reference herein to an 
"mRNA" or "messenger RNA" refers to a ribonucleic acid molecule which is generated by 
the transcription of a DNA sequence. 

10 

In one embodiment, the genetic sequence is DNA. 

Accordingly to this embodiment, the present invention contemplates a method for 
modulating the expression of a genetic sequence wherein said sequence comprises an ORF 
15 having an RTG corresponding to an authentic translation initiation site of said ORF where 
R is A or G and a nucleotide sequence 5' of said authentic translation initiation site, said 
method comprising introducing or removing one or more RTG triplets in said 5' nucleotide 
sequence upstream of said authentic translation initiation site such that upon expression of 
said genetic sequence, there is a respective decrease or increase in the level of expression. 

20 

In another embodiment, the genetic sequence is RNA. 

Accordingly, the present invention further contemplates a method for modulating the 
expression of a genetic sequence wherein said sequence comprises an ORF having an RUG 
25 corresponding to a translation initiation site of said ORF where R is A or G and a 
nucleotide sequence 5* of said translation start site, said method comprising introducing or 
removing one or more RUG triplets in said 5' nucleotide sequence upstream of said 
authentic translation initiation site such that upon expression of said genetic sequence, 
there is a respective decrease or increase in the level of expression. 



30 
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The term "translation initiation site" or "initiation codon" refers to the RTG/RUG (R = A 
or G) that is selected as the initiating RTG/RUG for the translation of an RNA sequence 
comprising a translatable ORF. It is generally understood that genetic elements including 
nucleotides proximal to the translation initiation RTG or RUG provide part of the initiation 
5 information. Although the present invention extends to an RTG/RUG where R is A or G, 
the preferred R nucleotide is A. The translation initiation site may be "authentic", i.e. the 
correct start site of an ORF or it may be "pseudo" if located upstream of the authentic start 
site. 

10 The term "open reading frame" or "ORF" refers to a sequence of nucleotide triplets 
between start and stop codons, uninterrupted by internal stop codons. A "triplet" refers to a 
series of three nucleotides. Generally, an ORF encodes a peptide, polypeptide or protein. A 
genetic sequence to be expressed, in accordance with the present invention, is proposed to 
comprise an ORF and an upstream nucleotide sequence. The level of expression is 

15 proposed to be determined by the number of RTG/RUG triplets in this upstream sequence. 
Furthermore, the level of expression is proposed to be influenced by the number of pseudo- 
ORFs within the genetic sequence. These can be created or destroyed by introducing 
termination signals in the 5' leader sequence or, in a different reading frame, in the coding 
sequence downstream of the authentic translation initiation site or in the 3' region. 

20 

As stated above, reference herein to a "5' nucleotide sequence" refers generally to a 5' 
leader sequence, an upstream sequence (i.e. upstream of an ORF or authentic translation 
initiation site) or a 5 1 untranslated region or 5'UTR. The term "untranslated region" or 
"UTR" is a term of art referring to the particular location of a genetic sequence relative to 

25 the translation initiation site. However, the use of these terms is not to exclude the 
possibility that some partial translation may occur in this region. The 5' leader sequence is, 
therefore, a genetic element located 5' to the coding region of an mRNA transcript and 
provides the information necessary for correct translation initiation (i.e. the selection of a 
translation initiation site). The 5' leader sequence is further defined as that portion of an 

30 mRNA molecule which extends from the 5' cap site to the RUG translation initiation 
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codon. The term "cap" refers to the 7-methyl guanosine nucleotide structure present at the 
very 5' end of the mRNA transcript. 

The term "translation" refers to the multi-step biochemical pathway of translation. The 
5 regulation of translation and protein synthesis is the basis of cellular growth and 
differentiation. The term "translation" further relates to cap-dependent initiation and 
ribosome shunting cap. Dependent translation initiation is the major translation initiation 
pathway in eukaryotic cells. In this process, dissociated 40S ribosomal subunits "scan" an 
mRNA to identify and locate the start or initiation codon. Once an initiation complex has 

10 been formed between the 40S subunit and the 5' leader sequence of the mRNA, the 60S 
ribosomal subunit joins the initiation complex to form the 80S ribosome. This complex is 
then competent to initiate translation. Ribosome shunting is another pathway of translation 
initiation in which ribosomes bind to the mRNA in a cap-dependant manner, however, they 
are capable of "jumping" over large regions of the mRNA containing RNA secondary 

15 structure, upstream RUGs and upstream (uORFs) to a specific initiator RUG. The term 
"translation" encompasses all such mechanisms. 

Reference herein to "RTG" or "RUG" refers to a nucleotide triplet composed of 
deoxyribonucleotides and ribonucleotides, respectively. The symbol R referred to herein 
20 refers to a purine selected from the group of G and A. As stated above, A is the preferred R 
nucleotide although the present invention extends to A or G. 

By "DNA" or "RNA" is meant a sequence of two or more naturally occurring 
deoxyribonucleotides or ribonucleotides that are covalently bonded together. The 

25 deoxyribonucleotides or ribonucleotides may be naturally occurring (i.e. A, C, T, U, G) or 
non-naturally occurring including modified forms of the ribonucleotides. One example of a 
modified RNA included within this term is phosphorothioate RNA. The term "RNA" also 
encompasses RNA molecules comprising a series or mixture of modified nucleotide 
analogues. Such modified RNA molecules provide more stable substrates for translation 

30 when introduced in to a host cell. As used herein "oligonucleotide" or "oligomer" is 
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generic to polydeoxyribonucleotides (containing 2'-deoxy-D-ribose or modified forms 
thereof), i.e. DNA, to polyribonucleotides (containing D-ribose or modified forms thereof), 
i.e. RNA, and to any other type of polynucleotide which is an N-glycoside or C-glycoside 
of a purine or pyrimidine base, or modified purine or pyrimidine base. The present 
5 invention further contemplates DNA/RNA hybrids which may be transcribed by a DNA or 
RNA polymerase. 

The present invention provides, therefore, chemically modified nucleic acid molecules 
including nucleic acid molecules with chemically modified bases and/or inter-base 

10 linkages which may be introduced directly into a cell, such as by way of a linear strand or 
strands of nucleic acid molecules and then transcribed or translated by a cell's 
transcription/translation machinery. Alternatively, such nucleic acid molecules may be 
introduced via viral constructs. Such modified forms of nucleic acid molecules may be 
more stable especially to exonuclease or ribonuclease digestion notwithstanding a possible 

1 5 decrease in transcription or translation efficiency. 

In one embodiment, the modulation results in an increase or elevation in the levels of 
expression of a genetic sequence. This occurs by removing RTG or RUG sites in the 5* 
leader region of the genetic sequence. 

20 

Reference in this case to a 5' leader sequence "of the genetic sequence" includes the leader 
sequence naturally associated with the genetic sequence or a 5' sequence ligated to the 5' 
end of a genetic sequence. The genetic sequence may be considered, therefore, as 
comprising a heterologous 5' leader sequence or homologous 5* leader sequence depending 
25 on whether the 5' leader sequence is naturally associated with a genetic sequence (i.e. 
homologous) or is introduced (i.e. heterologous). 

Another aspect of the present invention contemplates, therefore, a method for facilitating 
increased or elevated expression of a genetic sequence wherein said sequence comprises an 
30 ORF having an RTG or RUG wherein R is A or G corresponding to an authentic 
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translation initiation site of said ORF and a nucleotide sequence 5' of said authentic 
translation initiation site, said method comprising removing one or more RTG or RUG 
triplets in said 5' nucleotide sequence upstream of said authentic translation initiation site 
such that upon expression of said genetic sequence, there is an increase in the level of 
5 expression relative to expression of the genetic sequence in the absence of removal of any 
RTG or RUG triplet. 

The term "authentic" is not to necessarily imply a naturally occurring start site although 
this is encompassed by the present invention. An authentic start site may be introduced but 
10 nevertheless is the translation start site for an ORF. Reference to "removing" one or more 
RTG or RUG triplets includes changing a single base to destroy a RTG/RUG triplet or 
deleting the triplet altogether or deleting a nucleotide sequence comprising or particularly 
containing the RTG/RUG triplet. Furthermore, an RTG/RUG triplet may be disrupted or 
removed by inserting a nucleotide sequence between an RT or RU or between TG or UG. 

15 

In one embodiment, the genetic sequence is DNA. 

Accordingly, the present invention provides a method for facilitating increased or elevated 
expression of a genetic sequence wherein said sequence comprises an ORF having an RTG 

20 wherein R is A or G corresponding to an authentic translation initiation site of said ORF 
and a nucleotide sequence 5' of said authentic translation initiation site, said method 
comprising removing one or more RTG triplets in said 5' nucleotide sequence upstream of 
said authentic translation initiation site such that upon expression of said genetic sequence, 
there is an increase in the level of expression relative to expression of the genetic sequence 

25 in the absence of removal of any RTG triplet. 

Removal of an RTG site is as defined above for the removal of an RTG or RUG. In both 
cases, the preferred R is A. 

30 In another embodiment, the genetic sequence is RNA. 
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Accordingly, the present invention relates to a method for facilitating increased or elevated 
expression of a genetic sequence wherein said sequence comprises an ORF having an RUG 
wherein R is A or G corresponding to an authentic translation initiation site of said ORF 
5 and a nucleotide sequence 5' of said authentic translation initiation site, said method 
comprising removing one or more RUG triplets in said 5' nucleotide sequence upstream of 
said authentic translation initiation site such that upon expression of said genetic sequence, 
there is an increase in the level of expression relative to expression of the genetic sequence 
in the absence of removal of any RUG triplet. 

10 

In another preferred embodiment, the modulation results in a decrease or reduction in the 
levels of expression. This occurs by introducing or creating RUG or RTG triplets in the 5' 
leader sequence region. 

15 Accordingly, the present invention contemplates a method for facilitating decreased or 
reduced expression of a genetic sequence wherein said sequence comprises an ORF having 
an RTG or RUG where R is A or G corresponding to an authentic translation initiation site 
of said ORF and a nucleotide sequence 5' of said authentic translation initiation site, said 
method comprising introducing or creating one or more RTG or RUG triplets in said 5' 

20 nucleotide sequence upstream of said authentic translation initiation site such that upon 
expression of said genetic sequence there is a decrease in the level of expression relative to 
expression of the genetic sequence in the absence of introducing or removing any RTG or 
RUG triplets. 

25 Again, the genetic sequence may be DNA or RNA. 

In the case of DNA, the present invention provides a method for facilitating decreased or 
reduced expression of a genetic sequence wherein said sequence comprises an ORF having 
an RTG where R is A or G corresponding to an authentic translation initiation site of said 
30 ORF and a nucleotide sequence 5' of said authentic translation initiation site, said method 



1- unit MM'I^I I ITU ; M.I 

-26- 

comprising introducing or creating one or more RTG triplets in said 5' nucleotide sequence 
upstream of said authentic translation initiation site such that upon expression of said 
genetic sequence there is a decrease in the level of expression relative to expression of the 
genetic sequence in the absence of introducing or removing any RTG triplets. 

5 

In the case that the genetic sequence is RNA, the present invention is directed to a method 
for facilitating decreased or reduced expression of a genetic sequence wherein said 
sequence comprises an ORF having an RUG where R is A or G corresponding to an 
authentic translation initiation site and a nucleotide sequence 5' of said authentic 
10 translation initiation site of said ORF, said method comprising introducing or creating one 
or more RUG triplets in said 5' nucleotide sequence upstream of said authentic translation 
initiation site such that upon expression of said genetic sequence there is a decrease in the 
level of expression relative to expression of the genetic sequence in the absence of 
introducing or removing any RUG triplets. 

15 

The introduction, creation or removal of an RUG or RTG triplet where R is A or G may be 
accomplished in any number of ways including site directed mutagenesis, homologous 
recombination or chemical or physical mutagenesis and may involve amplification 
reactions such a polymerase chain reactions with selected primers. For example, an RUG 

20 or RTG site may be created from an XUG or XTG triplet where X is neither A nor G by 
site directed mutagenesis to induce a substitution of X to an A or G. Similarly, mutagenesis 
may be used to convert RXG to RTG or RUG, RUX to RUG or RTX to RTG. 
Alternatively, short nucleotide sequences may be inserted or deleted to introduce or 
remove RTG or RUG triplets. Reference to removing an RTG or RUG triplet is not to be 

25 taken as necessarily deleting the three bases since nucleotide substitutions, deletions and/or 
additions may be induced to create or destroy a particular sequence. 

Again, these embodiments include the manipulation of termination signals to create or 
destroy pseudo-ORFs within the leader sequence or beginning in the leader sequence and 
30 ending in the coding sequence or in the 3' region after the coding sequence. The present 
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invention extends, therefore, to introducing or removing pseudo-ORFs within the 5' leader 
sequence and/or authentic ORF and/or the 3'UTR.. 

Site directed mutagenesis of a 5' leader sequence is one particularly useful method for the 
5 production of a variant having an RTG/RUG triplet created or removed. Site-directed 
mutagenesis of DN A is a method of altering a nucleotide sequence at one or more desired 
positions. Site-directed (i.e. site-specific) mutagenesis allows the production of sequence 
variants through the use of oligonucleotide sequences that encode a complementary DNA 
sequence comprising the desired mutation, as well as a sufficient number of adjacent 

10 nucleotides to provide a primer sequence of sufficient size and sequence complexity to 
form a stable duplex on both sides of the proposed mutation. Typically, a primer of about 
20 to 25 nucleotides in length is preferred, with about 5 to 10 residues on both sides of the 
proposed mutation of the sequence being altered. In general, the techniques of site-specific 
mutagenesis are well known in the art. As will be appreciated, the site-specific 

15 mutagenesis technique typically employs a phage vector that exists in both a single- 
stranded and double-stranded form. Typical vectors useful in site-directed mutagenesis 
include vectors such as the M 1 3 phage, for example, or other phage vectors which are 
commercially available. Their use is well known to those skilled in the art. Plasmid vectors 
which contain a single-stranded phage origin of replication may be employed to obtain 

20 single-stranded DNA. Alternatively, nucleotide substitutions are introduced by 
synthesizing the appropriate DNA fragment in vitro and amplifying it by PCR procedures 
known in the art. 

A mutagenic PCR technique may also be used in creating 5' leader sequence variants. In a 
25 specific example of PCR mutagenesis, template plasmid DNA comprising a 5' leader 
sequence is linearized by digestion with a restriction endonuclease which has a unique 
recognition site in the plasmid DNA outside of the region to be amplified. Of this material, 
an aliquot is added to a PCR mixture containing PCR buffer, which contains the four 
deoxyribonucleotide triphosphates and is included in the GeneAmp (trade mark) kits 
30 (obtained from Perkin-Elmer Cetus, Norwalk, Conn, and Emeryville, Calif), and each 
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oligonucleotide primer. The PCR buffer contains divalent manganese ions which acts to 
increase the frequency at which the thermostable polymerase incorporates an incorrect 
nucleotide. This molecule then directs synthesis of a copy of itself in a subsequent round of 
PCR. As the reaction proceeds, a plurality of mutant sequences derived from the template 
5 5' leader sequence accumulate in the reaction. The reaction mixture is then overlayered 
with mineral oil. The reaction is denatured for at, for example, 100°C, and placed briefly 
on ice and then Thermus aquaticus (Taq) DNA polymerase, purchased from Perkin-Elmer 
Cetus, Norwalk, Conn, and Emeryville, Calif.) is added below the mineral oil layer. The 
reaction mixture is then inserted into a DNA Thermal Cycler (purchased from Perkin- 

10 Elmer Cetus). The Cycler may be programmed as appropriate to the conditions. One 
suitable programme is 2 min at 55°C, 30 sec at 72°C, then 19 cycles of the following: 30 
sec at 94°C, 30 sec at 55°C and 30 sec at 72°C. At the end of the program, the reaction 
tube is removed from the thermal cycler and the aqueous phase transferred to a new tube, 
extracted with phenol/chloroform (50:50 vol), and ethanol precipitated, and the DNA is 

15 recovered by standard procedures. This material is subsequently subjected to appropriate 
treatments for insertion into a vector. Individual clones arising from the PCR mutagenesis 
reaction can be sequenced to isolate individual nucleic acid molecules that contain the 
desired mutations. 

20 Random mutagenesis of 5' leader nucleotide sequences may also be accomplished by 
several different techniques known in the art, such as by altering sequences within 
restriction endonuclease sites, inserting an oligonucleotide linker randomly into a plasmid, 
by irradiation with X-rays or ultraviolet light, by incorporating incorrect nucleotides during 
in vitro DNA synthesis, by error-prone PCR mutagenesis, by preparing synthetic mutants 

25 or by damaging plasmid DNA in vitro with chemicals. Chemical mutagens include, for 
example, sodium bisulfite, nitrous acid, hydroxylamine, agents which damage or remove 
bases thereby preventing normal base-pairing such as hydrazine or formic acid, analogues 
of nucleotide precursors such as nitrosoguanidine, 5-bromouracil, 2-aminopurine, or 
acridine intercalating agents such as proflavine, acnflavine, quinacrine, and the like. 
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Generally, plasmid DNA or DNA fragments are treated with chemicals, transformed into 
E. coli and propagated as a pool or library of mutant plasmids. 

As stated above, expression of a genetic sequence is modulated by introducing or creating 
5 or removing particular pseudo-translation initiation sites. This may be visualized by using 
a schematic representation of an upstream, i.e. 5', nucleotide sequence. One such schematic 
representation is provided below. 

The present invention contemplates, therefore, a method for modulating the expression of a 
10 genetic sequence wherein said sequence comprises an ORF having an RTG or RUG 
corresponding to an authentic translation initiation site and a nucleotide sequence 5 1 of said 
authentic translation start site comprising the sequence:- 

[x!«x%n r {n;^r-.-n^ s k , yry? , ]RT/UG[z 1 z 2 „.z n ] t 

15 

wherein; 

RT/UG is the authentic translation initiation site and R is A or G; 

n, n 1 , n 11 and n'"are nucleotides selected from A, T or U, C or G or I; 
20 {n,n 2 ...n a }„, {nX.Xlc {","""••<}, and ^n™ ...n™ } s represent 

nucleotide sequences of a, b, c or d nucleotides in length and where each of n, n 1 , n 11 and 
n IM may be the same or different and its position is indicated by the subscript numeral i, 2> 
■ • * ■> 

[ziZ 2 ...z n ] represents a translation termination signal within an authentic 
25 ORF but not in the same reading frame as said authentic ORF; 

each of m, n, o, p, q, r, s or c may be the same or different and each is 0 or 1 
or if there is a repeat or multiple repeats, from about 2 or about 10; 
tisO, 1 or>l; 
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each of [x,x 2 x 3 ] tl , [x|x 2 x'] p and [x|'x"xj] r is selected from the triplet 

RTG, RUG, RYG, RTY 1 , RY M G, RUY m , ATG, GTG, AUG and GUG where R is A or G, 
and each of Y, Y 1 , Y 11 and Y 111 may be the same or different and each is a nucleotide with 
the proviso that Y is not T, Y l is not G, Y 11 is not U and Y 111 is not G; 
5 each of |y!y 2 y.'], [y " y" y" ] and |y j" y" 1 y" 1 ] represents a translation 

termination signal; and 

said method comprising altering the nucleotide triplets [x,x 2 x 3 ] n , 

[x|x 2 x 3 ] p and/or [x"x"xj ] r to introduce or remove an RTG or RUG to thereby 
respectively decrease or increase the level of expression of said genetic sequence. 

10 

Preferred termination signals are selected from but are not limited to TGA, TAA, TAG, 
UGA, UAA and UAG. 

When the genetic sequence is a DNA, the modulation involves introducing or removing an 
15 RTG. 

When the genetic sequence is RNA, the modulation involves introducing or removing an 
RUG. 

20 In this method, the [x,x 2 x 3 ] n , [x[x 2 x 3 ] p and [x " x " x " ] r triplets are mutagenized by a 

nucleotide substitution, deletion and/or addition or is chemically or physically mutagenized 
to introduce or create or remove an RTG or RUG site where R is A or G. Preferably, R is 
A. 

25 One skilled in the art will appreciate that the 5' nucleotide sequence may be longer or 
shorter than represented above and may include more potential pseudo-translation 
initiation sites than represented above. The 5' sequence as represented serves as a guide to 
the general structure. The essence of this aspect of the present invention is the 
manipulation of the 5' sequence to introduce, create or remove RTG or RUG triplets 
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wherein R is A or G. 

An RTG/RUG triplet in the 5' leader sequence is regarded as a pseudo-translation initiation 
site. The nucleotide sequence between an upstream RTG/RUG triplet and a termination 
5 signal is regarded as a pseudo-ORF (also referred to as an upstream ORF [uORF]). 

The essence of this aspect of the present invention is the manipulation of the number of 
pseudo-translation initiation sites and optionally pseudo-ORFs. Although the effects on 
expression is difficult to quantify given the inherent variabilities of a biological system, 
10 semi-quantification is possible and provides useful input data for an algorithm or other data 
processing means. 

Conveniently, the invention may be represented as a nucleotide sequence having the 
sequence :- 

15 

n x n x +i . . . n x + z 

wherein: 

n x is the first nucleotide in a leader sequence; 
20 x is 1 or>l (e.g. 100, 1000, 10,000 or greater); 

i is 1; 

z is an integer from 1 to 10; 

n x + z is the last nucleotide of the 5' leader sequence prior to the authentic 
translation initiation site; 

25 

wherein each n may be the same or different and each is A, C, G, U, T or I; 
wherein a numerical value (Nv) is assigned to a genetic element such that if:- 



30 



n, as defined above; 
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n x +i+i = A; 

n x+j +2 = Tor U; and 

n x +j+3 = G 

5 then the N v is 1; 

when 

n x +i 

10 n x+ j+] 
n x +i+3 
then the N v is 0.3; 

15 

and when:- 

n x +i 
n x +i+i 

20 n.x+i+2 

n x +i+3 

then the Nv is 0; 

25 such that the level of expression (E L ) of a nucleotide sequence operably linked at its 5' end 
to n x n x +i --. n x + 7 is inversely functionally associated (*) to the sum of Nv determined from 
the nucleotide sequence n x n x+i . . . n x+7 such that 

E *-L_ 

1 ZN V 



n, as defined above; 
G; 

T or U; and 
G 



n as defined above; 
CorG; 

T or C or G; and 
A, T or C 
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In this case, the sum is calculated from nucleotide x to nucleotide x+z . The present invention 
provides data processing means, therefore, to establish a pattern of likelihood of expression 
of a genetic sequence based on the RTG/RUG content of the 5' sequence upstream of an 
5 authentic translation initiation site. 

In one particularly useful embodiment, the 5' leader sequence is derived from a GUI gene 
leader sequence. For example, SEQ ID NO:l comprises the sequence of the 5' leader 
sequence from murine GLI1 and is termed Ma-UTR. This sequence comprises five ATG 
10 sites interspersed in a sequence of nucleotides prior to the authentic ATG site. Two splice 
variants, Mp-UTR and My-UTR (SEQ ID NOS:2 and 3, respectively) carry two and one 
ATG sites, respectively. 

The level of expression directed by these three UTRs, as defined by the expression 
15 E L , Ma-UTR has an SN V of 5.6, MP-UTR is 2.3 and My-UTR is 0.3. Hence, the 

I v is expected to be Ma-UTR>Mp-UTR>My-UTR. This correlates well with the 
experimental results obtained on the levels of translation products. 

Similarly, human GLI1 comprises two 5 r UTRs, namely Hp-UTR (SEQ ID NO:4) and Hy- 
20 UTR (SEQ ID NO:5) with ZNv of 3.6 and 0, respectively. Again, the level of expression 
conferred by these two UTRs is defined by Hy-UTR>Hp~UTR and this correlates well 
with the experimental findings. 

Consequently, deletion of part of the 5'UTR containing RTG/RUG sites results in elevated 
25 expression of the GLI 1 coding region in both murine and human systems. 

The y sequence of My-UTR and Hy-UTR is particularly useful, not only for a lack of 
pseudo-translation inhibition sites and pseudo-ORFs but also due to inherent properties in 
facilitating higher level expression of authentic ORFs operably linked thereto. The present 
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invention, in a particularly preferred embodiment, is directed to the use of My-UTR or Hy- 
UTR sequence to facilitate expression of an authentic nucleotide sequence attached 
thereto. 

5 The present invention extends to maintaining the same length of 5' leader sequence but 
altering the numbers of RTGs or RUGs. Alternatively, the 5' leader sequences may be 
reduced in length or increased in length by deletion or insertion of particular nucleotides. 

The present invention, therefore, provides nucleotide sequences comprising a 
10 predetermined number of RTG/RUG triplets for use in inserting into the 5' leader sequence 
of a genetic sequence or for inserting between a coding sequence and a promoter region. 

Accordingly, another aspect of the present invention provides a nucleotide sequence for 
use in modulating the expression of a genetic sequence wherein said genetic sequence 
15 comprises a coding region comprising a translation initiation site and optionally a 5' leader 
sequence such that said nucleotide sequence comprising a predetermined number of RTG 
or RUG triplets such that upon operable linkage to the 5' end of the genetic sequence, the 
level of expression of said genetic sequence is determined by the number of RTG/RUG 
triplets. 

20 

In one form, the RTG/RUG sites in the 5' leader region are regarded as pseudo-translation 
initiation sites wherein the number of such triplets is inversely functionally associated to 
the level of expression, i.e. the higher the number of RTG/RUG triplets, the lower the level 
of expression. 

25 

In another form, the level of expression of a genetic sequence is determined by the number 
of elements defined by the sequence:- 

[nRWGn] m 

30 where: 
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n is any nucleotide or sequence of nucleotides and wherein the sequence is 
defined by njn? ... n n n n+ ] and n is from about 1 to about 1000, preferably from about 2 to 
about 500, more preferably from about 3 to about 100 and even more preferably from 
5 about 4 to about 50; 

R is G or A; 

W is T or U; and 

m is from 0 to about 100, or from 0 to about 50 or from 0 to about 20 or 
from 0 to about 10 and represents the polydispersion of the nRWGn sequence throughout 
10 the 5' leader sequence. 

In a similar embodiment, adecrease in the level of expression may occur when a 5' leader 
sequence which comprises the sequence:- 

15 [nX,X 2 X,n] m 
wherein: 

n and m are as defined above and the triplet X1X2X3 is selected from Rn[G , 
n[TG , n\ UG , RTn^ , RUnJ where R is A or G and n 1 is any nucleotide with the proviso 
20 that nj is not T or U, nj, or n^ is not A and n [ 4 and 115 is not G; and 

wherein said sequence is mutated to cause a substitution or other mutation 
in n to change same to provide the triplet RTG or RUG where R is as defined above. 

The present invention further provides a plurality of nucleotide sequences each comprising 
25 a predetermined number of RTG or RUG triplets wherein R is A or G wherein said 
predetermined number is from 0 to about 100 and preferably from about 0 to about 50 and 
even more preferably from about 0 to about 15 such that to facilitate expression of a 
genetic sequence to a particular level, a particular nucleotide sequence is selected and 
inserted or placed between a promoter and said genetic sequence such that the promoter is 
30 operably linked to said genetic sequence and wherein the level of expression is inversely 



-36- 

functionally associated with the number of RTG or RUG triplets upstream of the authentic 
translation initiation site. 

Such nucleotide sequences include one or more oligonucleotide sequences such as an 
5 oligonucleotide sequence with a predetermined number of pseudo-translation initiation 
sites and optionally pseudo-ORFs. 

In a specific embodiment, the plurality of nucleotide sequences is selected from a 5' leader 
sequence from the GLI1 gene. 

10 

Accordingly, another aspect of the present invention is directed to an isolated nucleic acid 
molecule comprising a nucleotide sequence encoding a 5' mRNA sequence, said mRNA 
sequence selected from SEQ ID NO: I to SEQ ID NO:5 or a nucleotide sequence having at 
least 60% similarity to one or more of SEQ ID NO:l to SEQ ID NO:5 or a nucleotide 
15 sequence capable of hybridizing to one or more of SEQ ID NO:l to SEQ ID NO:5 or a 
complementary form thereof under low stringency conditions wherein said nucleotide 
sequence is capable of influencing the level of expression of a genetic sequence operably 
linked to said nucleotide sequence. 

20 As stated above, SEQ ID NOS:l to 3 correspond to Ma-UTR, Mp-UTR and My-UTR, 
respectively. SEQ ID NOS:4 and 5 correspond to H(3-UTR and Hy-UTR, respectively. 
Partial genomic sequences corresponding to M-UTR are shown in SEQ ID NOS:33 and 
34. Partial genomic sequences corresponding to H-UTR are shown in SEQ ID NOS:35 and 
36. Complete genomic sequences for M-UTR and H-UTR are shown in SEQ ID NOS: 59 

25 and 60, respectively. 

Accordingly, the present invention provides an isolated nucleic acid molecule comprising a 
nucleotide sequence encoding a non-translated mRNA sequence, said nucleotide sequence 
selected from SEQ ID NO:33 to SEQ ID NO:36 or a nucleotide sequence having at least 
30 60% similarity to one or more of SEQ ID NO:33 to SEQ ID NO:36 or a nucleotide 
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sequence capable of hybridizing to one or more of SEQ ID NO:33 to SEQ ID NO:36 or 
complementary forms thereof under low stringency conditions wherein said nucleotide 
sequence is capable of influencing the level of expression of a genetic sequence operably 
linked to said nucleotide sequence. 

5 

Another aspect of the present invention provides an isolated nucleic acid molecule 
comprising a nucleotide sequence encoding a non-translated mRNA sequence, said 
nucleotide sequence selected from SEQ ID NO: 59 or 60 or a nucleotide sequence having at 
least 60% similarity to one or more of SEQ ID NO:59 or 60 or a nucleotide sequence 
10 capable of hybridizing to one or more of SEQ ID NO: 59 or 60 or complementary forms 
thereof under low stringency conditions wherein said nucleotide sequence is capable of 
influencing the level of expression of a genetic sequence operably linked to said nucleotide 
sequence. 

15 The term "influencing" includes increasing or decreasing (i.e. modulating) the level of 
expression depending on which 5'UTR is employed. The My-UTR and Hy-UTR are 
particularly useful in facilitating expression of genetic sequences as attached thereto. 

The term ''similarity" as used herein includes exact identity between compared sequences 
20 at the nucleotide level. Where there is non-identity at the nucleotide level, "similarity" 
includes differences between sequences which result in different amino acids that are 
nevertheless related to each other at the structural, functional, biochemical and/or 
conformational levels. In a particularly preferred embodiment, nucleotide sequence 
comparisons are made at the level of identity rather than similarity. 

25 

Terms used to describe sequence relationships between two or more polynucleotides 
include "reference sequence", "comparison window", "sequence similarity", "sequence 
identity", "percentage of sequence similarity", "percentage of sequence identity", 
"substantially similar" and "substantial identity". A "reference sequence" is at least 5 but 
30 frequently 12 to 18 and often at least 25 or above, such as 30 monomer units, inclusive, of 
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nucleotides. Because two polynucleotides may each comprise (1) a sequence (i.e. only a 
portion of the complete polynucleotide sequence) that is similar between the two 
polynucleotides, and (2) a sequence that is divergent between the two polynucleotides, 
sequence comparisons between two (or more) polynucleotides are typically performed by 
5 comparing sequences of the two polynucleotides over a "comparison window" to identify 
and compare local regions of sequence similarity. A ''comparison window 1 ' refers to a 
conceptual segment of typically 5 nucleotides that is compared to a reference sequence. 
The comparison window may comprise additions or deletions (i.e. gaps) of about 20% or 
less as compared to the reference sequence (which does not comprise additions or 

10 deletions) for optimal alignment of the two sequences. Optimal alignment of sequences for 
aligning a comparison window may be conducted by computerized implementations of 
algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software 
Package Release 7.0, Genetics Computer Group, 575 Science Drive Madison, WI, USA) or 
by inspection and the best alignment (i.e. resulting in the highest percentage homology 

15 over the comparison window) generated by any of the various methods selected. Reference 
also may be made to the BLAST family of programs as for example disclosed by Altschul 
et ah (36). A detailed discussion of sequence analysis can be found in Unit 19.3 of Ausubel 
et ah (37). 

20 The terms "sequence similarity" and "sequence identity" as used herein refers to the extent 
that sequences are identical or functionally or structurally similar on a nucleotide-by- 
nucleotide basis over a window of comparison. Thus, a "percentage of sequence identity", 
for example, is calculated by comparing two optimally aligned sequences over the window 
of comparison, determining the number of positions at which the identical nucleic acid 

25 base (e.g. A, T, C, G, I) occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the window 
of comparison (i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. For the purposes of the present invention, "sequence 
identity" will be understood to mean the "match percentage" calculated by the DNASIS 

30 computer program (Version 2.5 for windows; available from Hitachi Software engineering 
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Co., Ltd., South San Francisco, California, USA) using standard defaults as used in the 
reference manual accompanying the software. Similar comments apply in relation to 
sequence similarity. 

5 Reference herein to a low stringency includes and encompasses from at least about 0 to at 
least about 15% v/v formamide and from at least about 1 M to at least about 2 M salt for 
hybridization, and at least about 1 M to at least about 2 M salt for washing conditions. 
Generally, low stringency is at from about 25-30°C to about 42°C. The temperature may 
be altered and higher temperatures used to replace formamide and/or to give alternative 

10 stringency conditions. Alternative stringency conditions may be applied where necessary, 
such as medium stringency, which includes and encompasses from at least about 16% v/v 
to at least about 30% v/v formamide and from at least about 0.5 M to at least about 0.9 M 
salt for hybridization, and at least about 0.5 M to at least about 0.9 M salt for washing 
conditions, or high stringency, which includes and encompasses from at least about 31% 

15 v/v to at least about 50% v/v formamide and from at least about 0.01 M to at least about 
0.15 M salt for hybridization, and at least about 0.01 M to at least about 0.15 M salt for 
washing conditions. In general, washing is carried out T m = 69.3 + 0.41 (G+C)% (38). 
However, the T m of a duplex DNA decreases by 1°C with every increase of 1% in the 
number of mismatch base pairs (39). Formamide is optional in these hybridization 

20 conditions. Accordingly, particularly preferred levels of stringency are defined as follows: 
low stringency is 6 x SSC buffer, 0.1% w/v SDS at 25-42°C; a moderate stringency is 2 x 
SSC buffer, 0.1% w/v SDS at a temperature in the range 20°C to 65°C; high stringency is 
0.1 x SSC buffer, 0.1% w/v SDS at a temperature of at least 65°C. 

25 The present invention further provides a genetic construct comprising a genetic element 
having a predetermined number of pseudo-translation initiation sites, i.e. RTG/RUG 
triplets. Generally, the genetic element is in a vector construct operably linked at its 5' end 
to a promoter and comprising one or more unique or semi-unique restriction endonuclease 
sites within a nucleotide which is part of or linked to the genetic element. These sites are 

30 used to insert a nucleotide sequence to be expressed. 
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The present invention extends to a plurality of genetic constructs each with a genetic 
element within a predetermined number of pseudo-translation sites or a plurality of genetic 
elements for use in insertion into a genetic construct. 

5 

Accordingly, another aspect of the present invention contemplates a genetic construct 
comprising a promoter linked to a genetic element and one or more restriction 
endonuclease sites to facilitate insertion of a nucleotide sequence to be expressed by said 
promoter wherein said genetic element comprises a predetermined number of pseudo- 
10 translation initiation RTG/RUG triplets wherein R is A or G such that the level of 
expression of said nucleotide sequence by said promoter is inversely functionally 
associated with the number of RTG/RUG triplets in said genetic element. 

The genetic construct may be a plasmid, vector, viral construct, linear or covalently closed 
1 5 circular molecule. 

The nucleotide sequence to be expressed is also referred to as a coding sequence or ORF. 

As used herein, a "construct" is a nucleic acid molecule into which a desired nucleotide 
20 sequence may be inserted by restriction and ligation. Such construct is useful for transport 
between different genetic environments or for expression in a host cell. Constructs are 
typically composed of DNA although RNA constructs are also available. The constructs of 
the present invention are generally for eukaryotic cells such as animal, mammalian and 
plant cells. Constructs include, but are not limited to, plasmids, vectors and phagemids. A 
25 cloning construct is one which is able to replicate in a host cell, and which is further 
characterized by one or more endonuclease restriction sites at which the construct may be 
cleaned in a determinable fashion and into which a desired nucleotide sequence may be 
ligated generally so that the new recombinant construct retains its ability to replicate in a 
host cell. In the case of plasmids, replication of the desired sequence may occur many 
30 times as the plasmid increases in copy number within a host bacterium or just a single time 
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per host before the host reproduces by mitosis. In the case of phage, replication may occur 
actively during a lytic phase or passively during a lysogenic phase. An expression vector or 
construct is one into which a desired nucleotide sequence may be inserted by restriction 
and ligation such that it is operably joined to regulatory sequences and may be expressed 
5 as an RNA transcript. Vectors may further contain one or more marker sequences suitable 
for use in the identification of cells which have or have not been transformed or transfected 
with the vector. Markers include, for example, genes encoding proteins which increase or 
decrease either resistance or sensitivity to antibiotics or other compounds, genes which 
encode enzymes whose activities are detectable by standard assays known in the art (e.g. 
10 (3-galactosidase or alkaline phosphatase) and genes which visibly affect the phenotype of 
transformed or transfected cells, colonies or plaques. Preferred vectors are those capable of 
autonomous replication and expression of the structural gene products present in the DNA 
segments to which they are operably joined. 

15 As used herein, a coding sequence and regulatory sequences are said to be "operably" 
joined or linked when they are covalently linked in such a way as to place the expression 
or transcription of the coding sequence under the influence or control of the regulatory 
sequences. If it is desired that the coding sequences be translated into a functional protein, 
two nucleotide sequences are said to be operably joined if induction of a promoter in the 5' 

20 regulatory sequences results in the transcription of the coding sequence and if the nature of 
the linkage between the two nucleotide sequences does not (1) result in the introduction of 
a frame-shift mutation, (2) interfere with the ability of the promoter region to direct the 
transcription of the coding sequences, or (3) interfere with the ability of the corresponding 
RNA transcript to be translated into a protein. Thus, a promoter region would be operably 

25 joined to a coding sequence if the promoter region were capable of effecting transcription 
of that DNA sequence such that the resulting transcript might be translated into the desired 
polypeptide or protein. 

The precise nature of the regulatory sequences needed for gene expression may vary 
30 between species or cell types, but shall in general include, as necessary, 5', i.e. 5' non- 
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coding and 5'UTR non-transcribing and 5' non-translating sequences involved with the 
initiation of transcription and translation respectively, such as a TATA box, capping 
sequence, CAAT sequence, and the like. Especially, such 5' non-transcribing regulatory 
sequences will include a promoter region which includes a promoter sequence for 
5 transcriptional control of the operably joined gene that comprises a 5' untranslated leader 
sequence. Regulatory sequences may also include enhancer sequences or upstream 
activator sequences as desired. 

The constructs of the present invention are directed to providing a means of tailoring the 

10 expression of a nucleic acid molecule. Accordingly, the present invention contemplates a 
construct which contains one or more suitable restriction sites for insertion of a particular 
5' leader or signal sequence or 5' UTR which contains none or one or more pseudo- 
translation initiation sites each of which may or may not be proximal to a Kozac sequence 
or a Kozac-like sequence. The choice and design of an appropriate construct is within the 

1 5 ability and discretion of one of ordinary skill in the art. Vector construction is well-known 
to those skilled in the art. For instance, the DNA manifestation of an RNA virus, such as 
HIV, is cleaved using restriction enzymes to excise HIV encoding sequences from within 
the gag coding region, following the nef gene. A cloning cassette comprised of a polylinker 
containing multiple restriction sites is inserted into the region of the deletion prior to 

20 ligation to provide convenient restriction sites for cloning into the vector. A DNA fragment 
containing the 5' leader sequence containing the number of pseudo-translation initiation 
sites is then sub-cloned into one of these sites. Subsequently, an ORF may be inserted 
down stream of the polylinker. It is envisaged that suitable restriction enzyme sites for the 
cloning of an ORF and simultaneous construction of a translation initiation RTG would be 

25 Ncol or NdeL However, other restriction enzymes sites are also contemplated. The 
resultant vector will transcribe an mRNA with a 5' untranslated leader sequence capable of 
regulating the expression of the open reading frame to which the 5' leader sequence is 
operably linked. The level of expression will be is inversely functionally associated with 
the number of pseudo-translation initiation sites present in the 5* untranslated leader 

30 sequence. 
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Examples of promoters useful to practice the present invention, especially in the 
production of a genetic vaccine for humans, include but are not limited to promoters from 
Simian Virus 40 (SV40), Mouse Mammary Tumor Virus (MMTV) promoter, Human 
5 Immunodeficiency Virus (HIV) such as the HIV Long Terminal Repeat (LTR) promoter, 
Moloney Virus, ALV, Cytomegalovirus (CMV) such as the CMV immediate early 
promoter, Epstein Barr Virus (EBV), Rous Sarcoma Virus (RSV) as well as promoters 
from human genes such as human actin, human myosin, human hemoglobin, human 
muscle creatine and human metallothionein. 

10 

Promoter elements contemplated herein include but are not limited to MM_JNA2, 
MM_INA4, MMJNA5, MM_INA6, MM_INA7, HSJNAl, HS_INA6, HS_INA5, 
HS_INAD, HSJNAF, HS__INA4, HSJNAG, HSJNA7, HS_INB, MM_INB, HS_ING, 
HCMVJEI, MCMV_VIEI, HCMV_UL37, HCMV_US3, HSVl_ICPO, HSV1_IE63, 

15 HSV2_IE63, HSV1_ICP4, HSV2_IE3, HSVME68, HSV2JE45, HCMV_POLB, 
HCMVB22, HCMV_B27, HSVl_EXON, HSV1JJL49, HSV1_DUT, HSV1_B33, 
HSV1_B21, HSV1_RIR1, HSV1JUR2, HSV2_RIR2, HSV1_KITH, HSV2_KITH, 
HSV1_DNBI, HSV1_VGLB, HSV1_VGLD, HSV2_VGLI, HSV1_VGLE HSV1_PRTP, 
HSV1_VGLG, HSV1_TEGP, HSV1_VCAP, HSV1_UL11, HSV1_ATIN, HSV1_ATI2, 

20 HCMV_UL36, HCMVJ3P65, HCMV_GP71and HCMV_G150, HSV1_G5, HSV_VGLC, 
HSV2_VGLC, HSV1_VGLH, HSV1_G42, HSVl_ORIS, HSV1_UL45, HSV2JJL45, 
EBV_BAM, EBV_VP14, EBV_BCRF, EBV_BHL1, EBVJJL34, EBV_VP26, 
EBV_BFL2, EBV„UL73, EBV_YLR2, EBV_VGP3, EBV_YLR3, EBV_DUT, 
EBV__VGLL, EBV_13KB, EBV_HL1, EBV_CLI, EB V_DR 1 , EBV_DL2, EB V_DL I A, 

25 HSV_TYSY, EB V_EBNA_1 , EBV_EBNA_2, EB V_DL 1 , HS_PGK1, HS_CCEM, 
HS_CGM6, HIV1_LTR, HIV2_LTR, SRVl.LTR, SIV_LTR, SV40_TA_1, JCV_TL, 
POLY_COAT_l, SV40_TA_2, BKV_TA, POLY_TA_2, SV40_COAL, POLY_COAL, 
HBV_CORA_, GSHV_35KB_1, HBV_CORA_2, GSHV_35KB_2, DHBV_33KB, 
HBV_22_1, GSHV_23_1, HBV_22_2, GSHV_23_2, DHBV_18KB, BPV1_PL_1, 

30 BPV1_PL_2, BPV1_P79_1, HP18_VE6_1, BPV1_P89_2, HP16_VE, HP18_VE6_2, 
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BPV1_P24, BPV1_P30, AD2_E 1 A, AD5JE1A, AD7_E1 A, AD12_E1A_1, 
AD12_E1A_2, SA7P_E1A, AD2_E1B. AD5_E1B, AD7_E1B, AD12_E1B, AD2_V33P 
AD2__E3, AD5_E312, AD2_E411, AD2_PIV2, AD5_PIV2, AD7_4A2, AD2_9, AD5_9, 
AD7_HEX9, AD2__DNBI, AD5_DPOL, AD7_Y1, AD12_ML, AD2_L2A, MMLV_LTR, 
5 FMCF_LTR, MLVA_AKV, MLV_XENO, MMSV_LTR, GALV_LTR, SSV_LTR, 
GAFE_LTR, MMTV_PR73, BLV_LTR, HTV 1_LTR, HTL2_LTR, RSV_LTR, 
ALV_LTR, RAV2_LTR, GG_EV1, SNV_LTR, AAV2_VNCA, AAV2_19, 
AAV2_COA3, H1_VNCS, Hl_COAT and B19_06. 

10 Examples of polyadenylation signals useful to practice the present invention, especially in 
the production of a genetic vaccine for humans, include but are not limited to, SV40 
polyadenylation signals and LTR polyadenylation signals. In particular, the SV40 
polyadenylation signal which is in pCEP4 plasmid (Invitrogen, San Diego, California), 
referred to as the SV40 polyadenylation signal, is used. 

15 

In addition to the regulatory elements required for DNA expression, other elements may 
also be included in the DNA molecule. Such additional elements include enhancers. The 
enhancer may be selected from the group including but not limited to human actin, human 
myosin, human hemoglobulin, human muscle creatine and viral enhancers such as those 
20 from CMV, RSV and EBV. 

Genetic constructs can be provided with mammalian origin of replication in order to 
maintain the construct extrachromosomally and produce multiple copies of the construct in 
the cell. Plasmids pCEP4 and pREP4 from Invitrogen, (San Diego, California) contain the 
25 Epstein Barr virus origin of replication and nuclear antigen EBNA-1 coding region which 
produces high copy episomal replication without integration. 

The genetic construct may be administered with or without the use of microprojectiles. It is 
preferred that the genetic constructs of the present invention may be delivered to the cells 
30 of an individual free of solid particles. As used herein, the phrase "free of solid particles" 
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is meant to refer to a liquid that does not contain any solid microprojectiles used as a 
means to perforate, puncture or otherwise pierce the cell membrane of a cell in order to 
create a port of entry for genetic material into the cell. 

5 The genetic constructs of the present invention may be used to administer an individual 
where it may be necessary to modulate the expression of, for example, of a particular 
antigen. This may be useful in inducing immunization conditions. In particular, modulated 
expression may be appropriate when delivering genetic sequences capable of expressing 
antigens derived from pathogens such as viruses, prokaryotes and pathogenic eukaryotic 

10 organisms such as unicellular pathogenic organisms and multi-cellular parasites. The 
present invention is particularly useful to regulate the immune response when over- 
expression of antigens may lead to an inappropriate immune response such as in an 
inappropriate histamine release or anaphylactic shock. In addition, the present invention is 
also useful to immunize an individual against protozoan pathogens which include a stage 

15 in the life cycle where they are intracellular pathogens. As used herein, the term 
" intracellular pathogen" is meant to refer to a virus or pathogenic organism that, at least of 
its reproductive or life cycle, exists within a host cell and therein produces or causes to be 
produced, pathogen proteins. 

20 The present invention also contemplates the use of a 5 1 leader sequence to regulate the 
expression of genetic sequences which encode therapeutic agents including peptides, 
polypeptides or proteins including, as stated above, genetic material which encodes 
peptides, polypeptides or protein that are antigens for immunization. Nucleic acid 
molecules of the present invention are introduced into the cells of an animal. In this aspect, 

25 the present invention may require that the high level of uptake and function of the nucleic 
acid molecules. The method of the present invention further comprises the steps of 
administering nucleic acid molecules, preferably free of viral particles, particularly 
retroviral particles, to the cell of an individual. Administration may also be in conjunction 
with a genetic vaccine facilitator agent. The genetic vaccine facilitator agent is selected 

30 from the group consisting of anionic lipids, extracellular matrix-active enzymes, saponins, 
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lectins, estrogenic compounds and steroidal hormones, hydroxylated lower alkyls, 
dimethyl sulfoxide (DMSO) and urea. The genetic vaccine facilitator agent preferably 
enhances the inflammatory response and/or enhances expression of the nucleic acid 
molecule in the tissue and/or facilitates the uptake of the nucleic acid molecule by the cell. 

5 

When taken up by a cell, the genetic construct which includes the nucleotide sequence 
encoding the desired protein operably linked to the regulatory elements herein described 
may remain present in the cell as a functioning extrachromosomal molecule or it may 
integrate into the cell's chromosomal DNA. DNA may be introduced into cells where it 

10 remains as separate genetic material in the form of a plasmid. Alternatively, linear DNA 
which can integrate into the chromosome may be introduced into the cell. When 
introducing DNA into the cell, reagents which promote DNA integration into 
chromosomes may be added. DNA sequences which are useful in promoting integration 
may also be included in the DNA molecule. Alternatively, RNA may be administered to 

15 the cell. It is also contemplated to provide the genetic construct as a linear 
minichromosome including a centromere, telomeres and an origin of replication. 

The present invention is particularly useful for the generation of genetically modified 
plants. Plants may be monocotyledonous or dicotyledonous. 

20 

Suitable methods for introduction of genetic material into cells include transformation 
using CaCl 2 and variations thereof, direct DNA uptake into protoplasts, PEG-mediated 
uptake to protoplasts, microparticle bombardment, electroporation, microinjection of 
DNA, microparticle bombardment of tissue explants or cells, vacuum-infiltration of tissue 
25 with nucleic acid and T-DNA-mediated transfer from Agrobacterium to the plant tissue. 

Agrobacterium-mediaied transformation of suitable plants may be effected by co- 
cultivating an explant to be transfected with Agrobacterium species having a T-DNA or T- 
DNA region comprising the genetic material to be transformed into the plant cells, for a 
30 time and under conditions sufficient for the genetic material to transfer into the plant cells. 
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The Agrobacteriwn species may be either high or low virulent strain. A particularly useful 
Agrobacterium species is Agrobacteriwn tumefaciens strain AGLO (45), AGL1 (45), strain 
ICMP 8317 (46), strain EHA101 (47), strain LBA4404 (48) and strain C58 (49). 

5 For microparticle bombardment of cells, a microparticle is propelled into a cell to produce 
a transformed cell. Any suitable ballistic cell transformation methodology and apparatus 
can be used in performing the present invention. Exemplary apparatus and procedures are 
disclosed by Stomp et ai (U.S. Patent No. 5,122,466) and Sanford and Wolf (U.S. Patent 
No. 4,945,050). When using ballistic transformation procedures, the genetic construct may 
10 incorporate a plasmid capable of replicating in the cell to be transformed. 

Examples of microparticles suitable for use in such systems include 0. 1 to 10 urn and more 
particularly 10.5 to 5 \im tungsten or gold spheres. The genetic construct may be deposited 
on the microparticle by any suitable technique, such as by precipitation. 

15 

The genetic transformation and regeneration methods of the present invention may be 
employed to confer on a plant cell specifically-desired traits such as, for example, 
resistance to attack by disease-causing pathogenic agents, modified fatty-acid content, 
improved quality and the capability to synthesize novel, high value and value-added 
20 products and to induce different levels of resistance to herbicides and pesticides. 

Expression of genetic sequences in plants may also be required to be tissue or 
developmental ly specific. Suitable promoters or other regulatory sequences may be 
selected to ensure tissue or developmental specificity. 

25 

In accordance with one particularly preferred embodiment of the present invention, a 
genetic construct comprising a desired trait may be incorporated into a plasmid capable of 
replicating in a plant cell, coated onto gold or tungsten microparticles by, for example, 
precipitation and bombarded into embryogenic callus to produce transformed callus cells, 
30 capable of being regenerated into transgenic plantlets. The transformed embryogenic callus 
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cells of the present invention are selected under any one of a number of suitable selective 
agents well known to those skilled in the art. Alternatively, the gold or tungsten 
microparticles may be bombarded into immature embryos from which selected 
transformed embryogenic callus, capable of being regenerated into transgenic plantlets, is 
5 induced to form. 

To facilitate identification of transformed cells, the embryogenic callus is bombarded with 
a further genetic construct, comprising a selectable or screenable marker gene. The actual 
choice of a marker is not crucial as long as it is functional (i.e. selective) in combination 
10 with the plant cells of choice. The marker gene and the gene of interest do not have to be 
linked, since co -transformation of unlinked genes as, for example, described in U.S. Patent 
No. 4,399,216 is also an efficient process in plant transformation. 

Included within the terms "selectable or screenable marker genes" are genes that encode a 
15 "secretable marker" whose secretion can be detected as a means of identifying or selecting 
for transformed cells. Examples include markers that encode a secretable antigen that can 
be identified by antibody interaction or secretable enzymes that can be detected by their 
catalytic activity. Secretable proteins include, but are not restricted to, proteins that are 
inserted or trapped in the cell wall (e.g. proteins that include a leader sequence such as that 
20 found in the expression unit of extensin or tobacco PR-S); small, diffusible proteins 
detectable, for example, by ELISA; and small active enzymes detectable in extracellular 
solution such as, for example, a-amylase, P-lactamase, phosphinothricin acetyltransferase). 

Examples of bacterial selectable markers are the dal genes from Bacillus suhtilis or 
25 Bacillus licheniformis , or markers that confer antibiotic resistance such as ampicillin, 
kanamycin, erythromycin, chloramphenicol or tetracycline resistance. Exemplary 
selectable markers for selection of plant transformants include, but are not limited to, a hyg 
gene which encodes hygromycin B resistance; a neomycin phosphotransferase (neo) gene 
conferring resistance to kanamycin, paromomycin, G418 and the like as, for example, 
30 described by Potrykus et al. (50); a glutathione-S-transferase gene from rat liver conferring 
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resistance to glutathione derived herbicides as, for example, described in EP-A 256 223; a 
glutamine synthetase gene conferring, upon overexpression, resistance to glutamine 
synthetase inhibitors such as phosphinothricin as, for example, described WO87/05327, an 
acetyl transferase gene from Streptomyces viridochromogenes conferring resistance to the 
5 selective agent phosphinothricin as, for example, described in EP-A 275 957, a gene 
encoding a 5-enolshikimate-3-phosphate synthase (EPSPS) conferring tolerance to N- 
phosphonomethylglycine as, for example, described by Hinchee et al. (51), a bar gene 
conferring resistance against bialaphos as, for example, described in W09 1/02071; a 
nitrilase gene such as bxn from Klebsiella ozaenae which confers resistance to bromoxynil 
10 (52); a dihydrofolate reductase (DHFR) gene conferring resistance to methotrexate (53); a 
mutant acetolactate synthase gene (ALS), which confers resistance to imidazolinone, 
sulfonylurea or other ALS-inhibiting chemicals (EP-A-0 154 204); or a mutated 
anthranilate synthase gene that confers resistance to 5-methyl tryptophan. 

15 Preferred screenable markers include, but are not limited to, a uidA gene encoding a P- 
glucuronidase (GUS) enzyme for which various chromogenic substrates are known; a p- 
galactosidase gene encoding an enzyme for which chromogenic substrates are known; an 
aequorin gene (54), which may be employed in calcium-sensitive bioluminescence 
detection; a green fluorescent protein gene (55); a luciferase (luc) gene (56), which allows 

20 for bioluminescence detection; a p-lactamase gene (57), which encodes an enzyme for 
which various chromogenic substrates are known (e.g. PADAC, a chromogenic 
cephalosporin); an R-locus gene, encoding a product that regulates the production of 
anthocyanin pigments (red colour) in plant tissues (58); an a-amylase gene (59); a 
tyrosinase gene (60) which encodes an enzyme capable of oxidizing tyrosine to dopa and 

25 dopaquinone which in turn condenses to form the easily detectable compound melanin; or 
a xylE gene (61), which encodes a catechol dioxygenase that can convert chromogenic 
catechols. 

As will also be appreciated by a person skilled in the relevant art, media components 
30 suitable for effecting the formation of transformed polyembryogenic callus from 
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transformed embryogenic callus are many and varied. Such components may include but 
are not limited to sugars, amino acid and vitamin supplements, and growth promoting 
hormones such as auxins and cytokines. Sometimes, an improvement in the speed or 
efficacy of the desired outcome may be achieved by manipulation of these components and 
5 the concentrations at which they are present. In accordance with the present invention, 
preferred components have been found to include an auxin, such as naphthalene acetic 
acid, indole acetic acid or indole butyric acid. 

In one embodiment, the auxin is indole butyric acid. In a particular embodiment, indole 
10 butyric acid is present in the medium at a concentration in the range 1-10 uM, preferably 
3-8 uM and even more preferably 4-6 (iM. Those skilled in the art will appreciate that the 
concentration may be varied somewhat and still be effective for the desired purpose. 

The method of the present invention is initiated by obtaining an explant from a plant, 
15 generally in the form of leaf or root tissue or an immature embryo. After sterilization, the 
explant is transferred to callus induction medium, comprising for example, MS salts and 
Y3 vitamins, supplemented appropriately, and incubated at 28°C in the dark. For leaf and 
root explants, supplements include myo-Inositol, L-glutamine, sucrose, activated charcoal, 
agar and an auxin, such as for example, 4-dichlorophenoxy-acetic acid (2,4-D). For 
20 immature embryo explants, supplements include myo-Inositol, L-glutamine, sucrose, 
NaFeEDTA and, preferably, thiamine or coconut water. Coconut water, at a concentration 
of 50-150 ml/1 and more preferably 100ml/l, is a particularly-preferred supplement. 

For leaf and root explants, incubation continues for a period of about 6-8 weeks or until the 
25 beginning of production of embryogenic calli, following which the cultures are sub- 
cultured every 1 2 weeks. In the case of immature embryo explants, incubation continues 
with sub-culturing every 3-4 weeks, for up to 12 weeks or until embryogenic calli were 
produced. 



-51 - 



Embryogenic calli are maintained on embryogenic medium, comprising, for example, MS 
salts, macro- and micro-nutrients and Y3 vitamins supplemented with myo-Inositol, L- 
glutamine, L-asparagine, sucrose and auxins, such as for example, 10 jaM 2,4-D and 5\xM 
naphthalene acetic acid (NAA), and are incubated at 28°C in the dark. Embryogenic calli 
5 are sub-cultured every 30 days into fresh nutrient medium. During maintenance of said 
embryogenic callus cultures, distinct morphological Types (I, II and III) of embryogenic 
callus are obtained. 

Type I, Type II or Type III embryogenic callus, and preferably Type II embryogenic 
10 callus, is subjected to transformation via microparticle bombardment of gold particles, 
having diameter of preferably 0.1-3.0 microns, more preferably 0.5-2.0 microns and still 
more preferably 0.8-1.2 microns, onto which has been precipitated a desired-DNA- 
containing genetic construct. Gold particles are bombarded into preferably 0.1-2.0 grams 
wet weight of embryogenic callus, more preferably 0.3-1.0 gram wet weight of 
15 embryogenic callus, and still more preferably 0.5-0.7 gram wet weight of embryogenic 
callus. 

The putatively-transformed embryogenic callus cultures are then subjected to selection 
under an appropriate selection environment. For example, selection of transformed cells 

20 may be achieved using a variety of chemical agents, such as antibiotics (e.g. hygromycin 
and/or geneticin (G418)) or herbicides (e.g. BASTA (trademark)). Use of these agents may 
require the inclusion of a selectable marker gene, such as but not limited to the hph gene 
for hygromycin resistance or the nptll gene for geneticin or kanamycin resistance, and the 
bar gene for BASTA (trademark) resistance. These genes render the transformed cells 

25 resistant to the selection agent. 



One particularly useful selection medium comprises BASTA (13.5% PPT) at 
concentrations of about 35-45 ppm, and preferably 40 ppm, to about 75-85 ppm and 
preferably 80 ppm, or at gradually increasing concentrations of from about 5-15 ppm, and 
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preferably 10 ppm, to about 15-25 ppm, and preferably 20 ppm, and finally to about 35-45 
ppm, and preferably 40 ppm at 3 weeks post-bombardment. 

Following selection, transformed embryogenic callus cultures are generally transferred to 
5 polyembryogenic-inducing medium, comprising for example, MS salts, macro- and micro- 
nutrients and Y3 vitamins supplemented with myo-Inositol, L-glutamine, L-asparagine, L- 
arginine, sucrose and agar, and an auxin, such as for example, 5 |iM indole butyric acid 
(IBA), in which they are cultured for generally at least about 3-6 months, more usually at 
least about 3-4 months and preferably at least 4 months, with sub-culturing into fresh 
10 medium generally every 20-40 days, more particularly every 25-35 days and preferably 
about every 30 days, until the formation of green-coloured poly embryogenic cultures. 

Transformed poly embryogenic cultures are finally transferred to cells an appropriate 
shoot-inducing medium, such as, for example, MS salts, macro- and micro-nutrients and 

15 Y3 vitamins, supplemented with myo-inositol, L-glutamine, L-asparagine, L-arginine, 
sucrose and agar, and an auxin such as for example 0.1 uM NAA. Normally, shoot 
induction requires at least about 2-4 months, and more usually at least about 3 months. 
Propagation of shoots generally requires an auxin, such as NAA. Production of roots 
generally requires a further auxin, such as for example 2, 4-D, in addition to NAA. Other 

20 possibly suitable auxins include indole acetic acid (IAA) and IBA. Accordingly, roots are 
induced in medium supplemented as above, but with the addition of 10 ^.M 2,4-D, 70 uM 
NAA, and activated charcoal. After incubation at 28°C for at least about 2 months, the 
plamiets are then transferred to soil. 

25 The present invention provides, therefore, a genetically modified plant cell or genetically- 
modified and regenerated multicellular plant or progeny thereof or parts of said transgenic 
plant exhibiting altered levels of expression of a target gene. 

The method of the present invention is also particularly useful for up-regulating or down- 
30 regulating the function of a promoter. In some circumstances, a promoter may be too 
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active. The introduction of pseudo-translation initiation sites or pseudo-ORFs between the 
promoter and the authentic translation initiation site is a useful mechanism for reducing 
expression. 

5 The effects of introducing or creating pseudo-translation initiation sites and in particular 
RTG/RUG triplets may be enhanced or otherwise modulated by creating or removing or 
modifying Kozac or Kozac-like sequences. A Kozac-like sequence includes a modified 
Kozac sequence. A modified Kozac sequence includes a weak or strong Kozac sequence. 
A Kozac sequence comprises the nucleotides:- 

10 

RnCCRWGn [SEQ ID NO:6] 

wherein: 

15 RisAorG; 

n is any nucleotide; and 
W is U or T. 

An example of a strong Kozac sequence is:- 

20 

GCCRCCRWGG [SEQ ID NO:7] 

whereas an example of a weak Kozac sequence is represented as:- 

25 ATTTCCRWGn [SEQ ID NO:8]. 

Accordingly, an existing RWG site where R is A or G and W is T or U may be made more 
efficient is a pseudo-translation initiation site by manipulating Kozac or Kozac-like 
sequences proximal to said pseudo-translation initiation triplet to either create or destroy a 
30 Kozac or Kozac-like sequence or to change from a weak to strong or strong to weak Kozac 
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sequence. 

For example, 5' leader sequence may include the nucleotides: - 
5 ATTTCCTTGA [SEQ ID NO:9]. 

A first manipulation is to create an ATG site resulting in the sequence: - 

ATTTCCATGA [SEQ ID NO: 10]. 

10 

The ATG site is located within a weak Kozac sequence. This sequence may be 
manipulated to render it a strong Kozac sequence such as by substituting "ATTT" with 
"GCCA/G", to create the sequence: - 

15 GCCA/GCCATGA [SEQ ID NO: 1 1]. 

The first A after the ATG may also be changed to a G. 

The effect of such a modification is to enhance the pseudo-translation initiation activity of 
20 the introduction ATG triplet thereby increasing the down-regulating effects of the 
introduction of the pseudo-translation inhibiting triplet. 

Accordingly, another aspect of the present invention contemplates a method for 
modulating expression of a genetic sequence, wherein said genetic sequence comprises an 

25 ORF with an authentic translation initiation site and further comprising a sequence 
upstream of said authentication translation initiation site where said method comprises 
introducing, creating or removing one or more pseudo-translation initiation triplets having 
the structure RWG wherein R is A or G and W is T or U in combination with introducing, 
creating or removing a Kozac or Kozac-like sequence proximal to said RWG such that the 

30 number of RWG triplets and Kozac or Kozac-like sequences is inversely functionally 
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associated with expression of said genetic sequence. 



Preferably, in one embodiment, the manipulation results in a sequence comprising:- 



RnCCRWGn 



wherein: - 



R is A or G; 

n is any nucleotide; and 

W is T or U. 



The present invention is further described by the following non-limiting Examples. 
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EXAMPLE 1 

Reverse transcriptase-PCR (RT-PCR), 5* RACE and PCR 

Skin, brain, heart, kidney, liver, lung, muscle, stomach, spleen, testis and tongue were 
5 obtained from various strains of neonatal, juvenile and adult mice. Total RNA was isolated 
from these tissues using TRI Reagent (Molecular Research Center). First-strand cDNA 
was synthesized from 5 fig of total RNA primed with oligo(dT)| 6 (Perkin Elmer) or 
random hexamers (Clontech) using superscript (trade mark) II Reverse Transcriptase (Life 
Technologies) in a total volume of 20 ul One-tenth by volume of the cDNA was used as 

10 the template for subsequent PCR reactions. Primer pairs (listed in Table 1) for mouse GLI1 
corresponding to sequences within exon 1 (mGHFl) and exon 2 (mGliR2) and for human 
GLI1 corresponding to exon 1 (hGliFl) and exon 2 (hGliR2) sequences were used. Each 
PCR reaction was repeated at least three times with different RNA preparations and 
included negative controls for each set of reactions. For 5' RACE, the inventors used a 

15 cDNA template that was generated using the primer RACE1 and C-tailed using terminal 
transferase according to the manufacturer's protocol (Life Technologies). PCR was 
performed using the RACE anchor and adapter primers (Life Technologies) and the GLI1- 
specific nested primer RACE2. Mouse genomic DNA was amplified using primers derived 
from exon la (mGliFla) and exon 2 (mGliR2) sequences. All PCR products were 

20 separated on 0.8-2% w/v agarose gels and visualized with ethidium bromide. Separated 
fragments were purified using QIAEX (registered trade mark) H (QIAGEN) and sequenced 
directly using the Big Dye-termination Kit and automated fluorescent sequencing on an 
ABI-Prism 377 DNA Sequencer (Perkin Elmer). 

25 EXAMPLE 2 

TP A treatment of mouse skin 

Neonatal and 7 day old mice (Swiss outbred) were treated with 20-50 p.1 of 100 u.g/ml 12- 
O-tetradecanoylphorbol 13-acetate (TPA) (Sigma) topically applied to back skin. Mice 
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were sacrificed at different time points (0 hr, 3 hr, 8 hr and 24 hr) post-application and total 
skin RNA prepared for RT-PCR as described above. 
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EXAMPLE 3 

S'-UTR-GFP constructs and functional analysis of 5'UTRs 

Each of the three alternative 5'UTRs of mouse GLI1 were generated by RT-PCR using 
5 primers mGliFl Nhe and mGliQR2 Age (Table 2) that contain Nhel and Agel restriction sites, 
respectively. PCR products were gel-purified and cloned into the pGEM-T Easy Vector 
(Promega). The four ATG codons of the aUTR were mutated sequentially using the 
QuikChange (trade mark) Site-Directed Mutagenesis kit (Stratagene) and complementary 
primers mGHMfl-4 [SEQ ID NOS: 19-22] and mGlilMrl-4 [SEQ ID NOS:23-26] (Table 

10 2). The yUTR sequence was multimerized using primers mGliFl Bam [SEQ ID NO:27] and 
mGLiR Bgl [SEQ ID NO:28] that contain BamHi and Bghl sites. The amplified fragment 
was cloned into pGEM-T Easy, sequence and the insert released by digestion with Bam HI 
and BgHl. The purified fragment was ligated in the presence of both restriction enzymes to 
form similarly oriented concatomers that were then used as templates for PCR 

15 amplification using mGLiFl Nhe [SEQ ID NO:29] and mGliR2 Age [SEQ ID NO:30]. The 
products of this reaction were sized on an agarose gel and the band corresponding to four 
copies of the yUTR cloned into the pGEM-T Easy Vector. All inserts were verified by 
sequence analysis, released from the pGEM-T Easy Vector by restriction digestion with 
Nhel and Agel and subcloned into the corresponding sites of the Green Fluorescence 

20 Protein (GFP) expression vector, pEGFP-Nl (Clontech). 

HaCaT, a human heratinocyte cell line (33), Cos-1 (40) and BHK-21 (41) cells were 
cultured in Dulbecco's modified Eagle's medium (DMEM) supplemented with 10% v/v 
fetal calf serum, ampicillin and streptomycin (Life Technologies). Primary mouse skin 

25 fibroblasts were obtained from newborn Swiss mice using an established protocol. Briefly, 
skin was removed, washed in PBS and incubated in 2.5% w/v dispase (Life Technologies) 
for 24 hr at 4°C. The dermis was separated from the epidermis and incubated in 0.2% w/v 
collagenase (Sigma) at 37°C for one hr. Cells were then pelleted and washed in PBS and 
cultured in DMEM media as described above. Primary fibroblasts were used within the 

30 first two weeks of culturing. 
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Transient transfection of GFP constructs was performed using LipofectAMINE PLUS 
Reagent (Life Technologies) according to the manufacturer's instruction. Cells were 
seeded on round, glass coverslips in 24-well (fluorescence microscope study) or 6-well 
5 plates (flow cytometry study) 24 hr prior to transfection and incubated at 37°C to a density 
of about 50-70% confluence. Cultures were washed twice in serum-free media and then 
incubated in DNA-PLUS-LipofectAMINE complexes in OPTI-MEM (Life Technologies) 
for three hr at 37°C. DMEM media containing serum was then added to the culture. One 
day later, cells were fixed for microscopy or harvested flow cytometry analysis. Twenty 
10 thousand cells per sample were analyzed on a FACSCalibur (Becton Dickinson) cell sorter, 
using CELLQuest software (Becton Dickinson). 

EXAMPLE 4 
Identification of alternative mouse GLI1 5'UTRs 

15 

In order to identify the transcriptional start site of the mouse GLI1 gene, 5'RACE was 
performed on total skin RNA from a BALB/c mouse and the resulting PCR product 
directly sequenced. This analysis showed that sequences around the translation start codon 
and at the beginning of the transcript were identical to the published mouse GLI1 sequence 

20 obtained from F9 cells (42). However, the RACE product also contained an additional 1 19 
bp, not present in the published sequence, located at the splice junction between exon II 
and exon III as numbered by Liu et al. (42). The inventors named this new 5'UTR 
variant,a-UTR and the published sequence as p-UTR (Figure 1A). To search for other 
possible splicing variants, RT-PCR was conducted using a forward primer located at exon 

25 I and a reverse primer that hybridizes immediately 5' of the GLI1 ATG codon (Table I). 
This PCR revealed a much smaller product which was sequenced directly and found to 
consist of only one 5' non-coding exon. This alternative mouse GUI variant corresponds 
in both size and sequence to the published human transcript obtained from a glioma cell 
line (13,42) and was denoted the y-UTR variant (Figure 1 A). 



30 
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In order to determine how these alternative transcripts were generated, the inventors 
characterized the genomic organization of this region. Mouse genomic DNA was amplified 
using the pnmer pair mGliFla [SEQ ID NO:18] and mGHR2 [SEQ ID NO:13] (Table 1). 
Sequence analysis of the 2.8 kb product identified the novel 119 nucleotide sequence 
5 within the a-UTR variant as an authentic exon (which the inventors denoted as exon lb) 
that is flanked by 2.5 kb (intron la) and 115 bases (intron lb) of intervening sequences 
(Figure IB). The identification of this additional non-coding exon required a change in the 
nomenclature used by the earlier study of Liu et al (42), with exon II becoming exon la 
and exon III (which encodes the translation start site) renumbered as exon 2 (see Figure 
10 IB). 



EXAMPLE 5 
Expression of mouse GLI1 5'UTRs 



1 5 The expression of the alternative 5'UTR variants was determined in neonatal, juvenile and 
adult tissues from various mouse strains by RT-PCR (Figure 2). This analysis revealed that 
the UTR variants had no particular tissue-specific expression pattern, but did show marked 
strain-specific differences. In all tissues examined, the larger UTR forms (a and p-UTR) 
predominate while the y-UTR appears as a minor amplified product (Figure 2A). In some 

20 strains, such as BALB/c. DBA and C57BL/6, the a-UTR variant was the major form, 
whereas in CD1 and SV129 strains, the p-UTR form was the predominant transcript 
(Figure 2B). In Swiss outbred mice, expression of the a-UTR and p-UTR was 
heterogeneous with some animals expressing both forms and others expressing only one. 
In a given individual, the expression profile of the two larger transcripts was identical in all 

25 tissues examined irrespective of the strain used (Figure 2A). The inventors also followed 
the expression of the UTR variants in postnatal skin development and found that the 
apparent levels of all GUI transcripts were reduced with increasing age and that the y- 
UTR transcript was not detected at all in adult skin. 



-61 - 

To evaluate whether the expression of these 5'UTR variants correlated with proliferative 
status, newborn and 7 day old animals were treated with TPA topically applied to back 
skin. These experiments revealed that expression of the larger UTR transcripts was 
maximally reduced at 3 hr post-application, whereas expression of the y-form was 
5 increased (Figure 3). Reduced expression of the a- and p-UTRs was still evident, albeit not 
as marked, 24 hr post-application). Since acute TPA treatment results in increased mitotic 
activity of the basal layer keratinocytes (44), these data indicate an association of the y- 
UTR transcript with proliferation and the a- and p-UTRs with differentiation. 

10 EXAMPLE 6 

Identification of alternative human GLll 5'UTRs 

The inventors searched for alternative human GLll transcripts in newborn foreskin by RT- 
PCR with primers derived from exon 1 and exon 2 (Table 1) of the published sequence 

15 (13). Two PCR products were generated and sequenced, with the smaller fragment 
corresponding to the published sequence (13) and the larger containing an additional 144 
bases located at the splice junction of exon 1 and exon 2 (Figure 4A). The larger transcript 
was termed p-UTR and the smaller sequence as y-UTR variant (Figures 4 A and 4B). 
Notably, the novel 144 nucleotide sequence found within the human p-UTR has significant 

20 homology with mouse exon la, but is 30 bases larger and contains an additional two ATG 
codons (Figure 4C). 

The p- and y-UTRs were also present in a brain sample and their identity was confirmed by 
sequence analysis (Figure 5). The expression of the p- and y-UTRs was further examined 
25 in HaCaT cells and seven BCC samples. The inventors found that the y-transcript was 
present in proliferating cultures of HaCaT cells and all BCC samples but in contrast to 
foreskin keratinocytes and brain tissue, the inventors were unable to amplify the p- 
transcript from these mRNAs (Figure 5). Therefore, the y-UTR transcript may represent 
the major variant expressed by proliferating cells in human tissues as well. 



30 
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EXAMPLE 7 
Genomic sequence of GUI 5' UTRs 

The partial mouse GLI1 UTR genomic sequence is shown in SEQ ID NOS:33 and 34 with 
5 a sequence between SEQ ID NO:33 and 34. Similarly, the partial human GLI1 UTR 
genomic sequence is shown in SEQ ID NOS:35 and 36, again with a sequence between 
SEQ ID NO:35 and 36. Complete genomic sequences for M-UTR and H-UTR are shown 
in SEQ ID NOS:59 and 60, respectively. 

10 EXAMPLE 8 

Functional analysis of the GLll 5* UTRs 

The 5'UTR is known to regulate gene expression by influencing the efficiency of 
translation. An examination of the GUI 5'UTR variants revealed three small upstream 

15 open reading frames (uORFs) in mouse a-UTR, two in human p-UTR, one uORF that 
overlaps the GUI ORF in mouse P-UTR and none in the y-UTRs (Table 2). The secondary 
structure for each 5'UTR was analyzed using the RNA folding prediction program 
MFOLD (44). This program predicted extensive secondary structures in the longer UTRs, 
with calculated free energy values of -72.7 to -94.5 kcal/mol for the mouse a-UTR, -65 

20 kcal/mol for the human p-UTR and -55 kcal/mol for the mouse p-UTR. The human and 
mouse y-UTRs were predicted not to form stable secondary structures and to have free 
energy values of -12 kcal/mol (Table 2). The presence of uORFs and stable secondary 
structures in the larger UTRs suggest that they are less efficiently translated than the y- 
variant that lacks these features. 

25 

To test the above prediction in vivo, the three mouse GUI 5'UTR fragments were cloned 
upstream of a GFP reporter gene (Figure 6A) and the constructs transiently transfected into 
HaCaT, Cos-1 and BHK-21 mammalian cell lines and primary mouse skin fibroblasts. The 
difference in the levels of GFP produced by these constructs was striking (Figure 6B),with 
30 the a-UTR construct producing the lowest fluorescence levels, the P-UTR construct 
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producing immediate levels and the y-variant producing the highest levels (even higher 
than the GFP vector control). Importantly, there was no apparent difference in the 
transfection efficiency of the constructs for a given cell type, showing that the increase in 
the number of brightly fluorescing cells transfected with the y-form is due to an increase in 
5 GFP production. 

To quantify these results, cells were also subjected to flow cytometry analysis (Figure 7A). 
The advantages of this technique over enzymic reporter assays are that GFP levels are 
determined in individual cells using a large number of cells (20,000 cells/construct) and 

10 untransfected cells are discarded from the calculations thus removing any bias due to 
transfection efficiencies. This analysis revealed that the a- and p-UTRs expressed the 
reporter 60-90% lower than the GFP vector control and whereas the y-UTR produced 2-3 
fold enhancement of expression (Figure 7B). Comparison of expression levels between the 
three variants reveals a 14-23 fold increase in GFP production by the mouse y-UTR 

1 5 construct over the a-UTR (y/a) and a 5-13 fold increase over the p-UTR (y/p) (Figure 7C). 
The greatest differences in GFP intensities were seen in HaCaT cells. These data show that 
the y-UTR facilitates expression of a heterologous protein whereas the a- and p-UTRs 
significantly suppress protein production. 

20 In order to determine whether this suppression is due to the presence of uORFs or some 
other property of the longer UTRs, such as increased secondary structure, two additional 
constructs were made. The uORFs of the a-UTR variant were removed by mutating all 
four upstream ATG codons to TTG. Significantly, the mutant a-UTR construct expressed 
the reporter at levels that approached that of the y-UTR construct (Figure 8). To ascertain 

25 whether the length of UTR sequence could influence expression levels, the inventors 
produced a y-UTR multimer containing four copies of this sequence in the same 
orientation. The y-UTR multimer, which contained 314 bp compared to the 307 bp of the 
a- variant, produced GFP levels that were only marginally lower than a single copy of the 
y-UTR (Figure 8). Taken together these data show that the uORFs of the Glil UTRs play a 

30 major role in the suppression of protein production. 




-64- 

In the following Examples, genetically modified plants were generated to show the effects 
of the GUI leader sequences on gene expression. 

EXAMPLE 9 

5 Plant material and general growth conditions 

Wild-type tobacco {Nicoticma tobaccum) line D38 was grown in the glasshouse to produce 
leaves for the transient assays with a GUS reporter gene carrying the various GLI 5' UTRs. 

10 Wild-type Arabidopsis thaliana (C24) was used in stable transformation experiments. 
Seeds were sprinkled over moist, sterilised potting mix and covered with a thin layer of 
finely ground vermiculite. To synchronize germination, seeds were stratified by placing 
pots at 4°C for 3-5 days. Plants were then transferred to a growth cabinet and grown at 
20°C under long days (16 hr photoperiod) to induce flowering. The soil was watered daily. 

1 5 Growth conditions for selection of transgenic Arabidopsis are given below. 

EXAMPLE 10 
General cloning techniques 

20 Below is a summary of the cloning techniques used to generate the constructs. 

Restriction enzyme digests 

All restriction enzymes used in the construction of the vectors were provided by New 
25 England Biolabs and digests were carried out in the appropriate buffers as specified by the 
manufacturer. In general, three to five times the recommended dosage of enzyme was 
used. Digestions were carried out at 37°C for a minimum of 3 hr to ensure complete 
digestion. 
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Agarose gel electrophoresis and DNA fragment purification 

Agarose gels were prepared using analytical grade agarose (Progen, Australia) dissolved in 
lx Tris borate EDTA (TBE). DNA samples were combined with gel loading buffer (62) 
5 and electrophoresed at 6V/cm. Following electrophoresis, gels were stained in ethidium 
bromide (0.5 ug/mL) and photographed on a UV transilluminator. DNA fragments were 
purified from agarose gels using Qiagen's QIAquick Gel Extraction Kit, following the 
manufacturer's instructions. 

1 0 Ligation of plasmid DNA 

All ligations except for those involving pGemT-easy were conducted using New England 
Biolabs T4 DNA ligase and the appropriate buffer in a volume of 10 or 20 uL at 16°C for 
16 hr. A molar ratio of 1:3 (vector to insert) was most commonly used for intermolecular 
15 ligations, with a reaction containing either 100 ng for smaller E. coli cloning vectors or 300 
ng for larger binary vectors. 

Transformation of DH5a competent cells 

20 50 uL ofDH5a competent cells were thawed on ice for approximately 10 min. The 
ligation reactions were then added to the cells, and incubated on ice for 30 min. The cells 
were then heat-shocked for 30 sec at 42°C and immediately placed back on ice for 5 min. 
An 800 uL aliquot of SOC media (62) was added and the cells were then incubated at 
37°C for 1 hr. Either 50 or 100 uL of the solution was then spread onto LB plates 

25 containing the appropriate antibiotic and the plates incubated overnight at 37°C. 

Screening colonies for ligation products 

For initial cloning into pGEMT-easy, the selection was based on blue white selection (62). 
30 For other cloning experiments, colonies were selected at random (due to the lack of blue 
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white selection) and sampled with a sterile pipette tip. The tip was then dipped into a 20 
uL diagnostic PCR containing 0.2 juM of each diagnostic oligonucleotide primer (which 
varied depending on the ligation), 200 uM dNTPs, in a buffer containing 10 mM Tris (pH 
8.3), 50 mM KC1, 1.5 mM MgCl 2 , 0.001% w/v gelatin, and 0.5 U of Taq DNA 
5 polymerase. The pipette tip was then used to inoculate a 5 mL LB culture (containing the 
appropriate antibiotic), which was then incubated at 37°C with shaking (100 rpm) for 16 
hi. Diagnostic primers were chosen to identify plasmids containing the appropriate insert. 
The PCR reactions were loaded into a PTC IM ~ 100 Thermal Controller and after a 5 min 
94°C hot start, were cycled as follows: 10 cycles at 94°C (15 sec), 60°C (15 sec), 72°C 
10 (45-180 sec depending on product size), then 25 cycles at 94°C (15 sec), 57°C (15 sec), 
72°C (45-180 sec). PCR products were then run on an agarose gel and visualized to 
identify recombinant plasmids. 

Plasmid purification 

15 

Cultures identified by PCR as containing the desired recombinant plasmids were 
subsequently minipreped using the BRESAspin™ (Geneworks) plasmid Mini-Kit 
following manufacturer's protocols. Diagnostic restriction enzyme digests were then 
performed to confirm the orientation or presence of the desired insert. If large quantities of 

20 the plasmid were required (e.g. for low copy number binary vectors), a larger volume (up 
to 400 mL) of LB containing the appropriate antibiotic was inoculated with a fresh colony 
containing the plasmid of interest and grown at 37°C with shaking (100 rpm) for 16 hr. 
The plasmid was then extracted using either the BRESApure ,M (Geneworks) plasmid 
Midi-Kit or Maxi-kit (depending on culture volume), according to the manufacturer's 

25 instructions. 
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EXAMPLE 11 
Amplification of GLI sequences for dimming constructs 

5 The template for the PCR was lng of pGEMT-easy (Promega, USA) containing the 
respective a, (3 or y sequences from murine GLI 1 . The 5' oligonucleotide primer used to 
amplify all sequences in the PCR was designated GLI5'-5«cI and its sequence was: 5'- 
TT GAGCTC AGTTCCAGCCCTGG -3' [SEQ ID NO:37]. A Sacl site (underlined) was 
incorporated at the 5' end to facilitate later cloning steps. The 3' oligonucleotide primer 
10 designed to amplify all sequences was designated GLI3'-AfcoI and its sequence was: 5'- 
AA CCATGG CGTCTCAGGGAA -3' [SEQ ID NO:38] and contained an Nco\ 
(underlined) site. 

The PCR was carried out on lng of template in a total volume of 20 uU in the presence of 

15 0.1 uM of the primers, 200 uM of dNTPs, 1.25 units of PFU DNA polymerase in the 
recommended buffer (Stratagene, USA). The samples were loaded into a PTC™- 100 
Thermal Controller (MJ Research, Inc., MA, USA) pre-heated to 85°C. The PCR started 
with 10 cycles at 94°C (15 sec), 58°C (15 sec), 72°C (30 sec) followed by 25 cycles at 
94°C (15 sec), 55°C (15 sec), 72°C (30 sec). The total PCR reactions were electrophoresed 

20 on a 2% w/v agarose gel and the products purified using the Qiagen QIAquick Gel 
extraction kit. The purified DNA fragments were cloned into pGEMT-easy (Promega, 
USA) following the manufacturer's recommendations. The plasmid containing the a clone 
was called UQC1421, the plasmid containing the p fragment was designated UQC1431, 
and the plasmid containing the y fragment was called pUQC1441. These fragments were 

25 subsequently authenticated by sequencing and comparison to the known sequences. 
Sequencing reactions were prepared using a PRISM Ready Reaction Bigdyedeoxy 
terminator kit (Applied Biosystems). The 20 uL reaction volumes contained 8 uL of 
reaction mix, 0.32 uM M13 F (5'-GGTTTCCCAGTCACCGAC-3' [SEQ ID NO:39]) or 
M13 R (5'-ACACAGGAAACAGCTATGACC-3' [SEQ ID NO:40]) and 500 ng of 

30 template plasmid. Twenty-five cycles of PCR were run at 94°C for 10 sec, 50°C for 5 sec 
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and 60°C for 4 min. Excess dye terminators were removed from the reaction mixture by a 
sodium acetate/ethanol precipitation, according to the manufacturer's instructions. The 
reactions were run on an Applied Biosystem 377 DNA sequencer. These a, p and y 
fragments were used to make the dimming constructs described below. 

5 

EXAMPLE 12 
Construction of dimming plasmids 

Two Not I sites were created to facilitate the cloning of the 35S:g7/:GUS:ocs transgene into 
10 the binary vector pUQC477. To achieve this, SLJ4D4 (Figure 10A) was digested with 
EcoRl, and ligated to a linker containing a Notl site. The resulting construct, designated 
pUQC1401, was then used in an additional ligation step to introduce a second Notl. This 
was done by digesting pUQC1401 with Hindlll and ligating to a linker containing another 
Notl site. The resulting plasmid was named pUQC141 1 (Figure 10B). 

15 

The following procedures were used to make the murine GL11 a, p and y dimming 
constructs. The dimming sequences were excised from pUQC1421 (a), pUQC1431 (p) and 
pUQCI441 (y) with Sacl and Nco\, and ligated into the corresponding sites between the 
35S promoter and GUS coding sequence of pUQC14U. The resulting plasmids were 
20 designated pUQC1451 (ct-GUS) [Figure 11 A), pUQC1461 (P-GUS) [Figure 1 IB) and 
pUQC1471 (y-GUS) [Figure 11C). These plasmids as well as the unmodified pUQC1411 
{H GUS) were used in the transient assays as well as further cloning steps. 

The binary vector used for Arabidopsis transformation was called UQC477 and was a 
25 modified version of the vector pNB96. pNB96 was digested with Hindlll to remove a 
superfluous 1 kb insert. This vector was ligated to an oligonucleotide containing a Notl site 
that restored the Hindlll site on one side of the ligation but not the other. The resulting 
construct (UQC477) contained a Notl site to facilitate cloning of the GUS transgene inserts 
and Hindlll site on one side to facilitate future cloning. Three binary constructs were 
30 created using UQC477 as a backbone. The plasmids pUQCHll (Q GUS transgene), 
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UQC1451 (a GUS transgene), pUQC146l (p GUS transgene), pUQC1471 (y GUS 
transgene) all contained Notl sites flanking the respective GUS transgene. The Notl sites 
were used to excise the transgenes and insert them into the Notl site of the binary vector 
UQC477. The inserts were orientated such that the 35S promoter of the GUS transgene 
5 was adjacent to, but in the opposite direction to the truncated 35S promoter controlling the 
BAR gene of the binary vector. The resulting binary vectors were called pUQC1481 (Q 
GUS transgene) [Figure 12A], pUQC1495 (a GUS transgene) [Figure 12B], pUQC1501 (P 
GUS transgene) [Figure 12C) and pUQC1511 (y GUS transgene) [Figure 12D). 

10 EXAMPLE 13 

Transient GUS assays in tobacco leaves 

Transient assays were performed using tungsten particle bombardment on tobacco leaves 
using a particle inflow gun. Tissue cultured tobacco leaves were cut into small pieces 

15 (approximately 0.5 cm 2 ) and placed on damp Wattman paper inside a sterile petri dish. The 
tungsten particle/plasmid DNA suspension was prepared by mixing, 25 ul of sterile 
tungsten particles (100 ng/ul), 5 ul plasmid DNA (1 ug/ul), 25 ul CaCl 2 (2.5 M) and 10 ul 
spermidine 0. 1 M) in a sterile Eppindorf. The mixture was incubated at 4°C for 5 minutes 
upon which 50 ul of the supernatant was removed and the remaining particles resuspended, 

20 5 uJ of the tungsten particle/DNA suspension was placed in the center of a screen in a 
syringe filter unit. The leaf pieces were placed 17 cm from the screen and protected by a 
baffle made of 500 mm nylon mesh. A vacuum of about 300 mm Hg was applied and the 
tungsten particles were discharged when helium at 80 PSI released. The leaf pieces were 
then incubated at 37°C over night and subsequently stained for GUS activity for 16 hr 

25 using a staining buffer consisting of 50 mM NaP0 4 (pH 7.0), 5 mM K ferricyanide, 5 mM 
K ferrocyanide, 0.3% Triton X-100 and 5 mg X-Gluc (per 10 ml) in a 2 ml Eppindorf. Leaf 
pieces were then cleared of chlorophyll using 70% v/v EtOH and viewed for GUS 
expression. 



30 
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EXAMPLE 14 
Transformation of Arabidopsis thaliana 

Arabidopsis thaliana was transformed using Agrobacterium tumefaciens via the floral dip 
5 method as described. The four constructs (pUQC1481, pUQC1495, pUQ1501 and 
pUQC151 1) described below were transformed using the following summary of the floral 
dip. 

Conjugation of binary constructs into Agrobacterium tumefaciens 

10 

Binary vectors were conjugated into A. tumefaciens by a tri-parental mating method. 
Agrobacterium was grown in a 10 mL LB culture containing 50 ug/mL rifampicin for 36 
hr at 28°C. Ten millilitre cultures containing 50 ug/mL of both a helper E. coli strain 
(pRK2013) and E. coli containing the binary vector of interest were grown for 12 hr at 

15 37°C Each culture was spun down at 3000 rpm for 10 min and the pellet resuspended in 1 
mL of LB. On an LB plate free of selection, 30 uL of each suspension was combined 
together and grown at 28°C for 16 hr. A streak from this plate was then grown on a plate 
containing 50 ug/mL rifampicin, 50 fig/mL kanamycin for 48 hr at 28°C. A single colony 
from this plate was then selected and grown on a plate containing the same selection, at 

20 28°C for 48 hr. The integrity of the transgene in the A. tumefaciens was confirmed with a 
diagnostic PCR test. 

Preparation of Agrobacterium for Arabidopsis transformation 

25 A single transformed colony was used to inoculate a 5 mL LB pre-culture containing 50 
^ig/mL rifampicin and 50 ug/mL kanamycin. After 48 hr at 28°C, the pre-culture was used 
to inoculate a 250 mL LB solution containing 50 ug/mL kanamycin. This culture was 
grown for 18-24 hr at 28°C and spun down at 5000 rpm for 10 min. 
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Transformation of Arabidopsis 

The transformed Agrobacterium pellet was resuspended in a 600 mL, 5% w/v sucrose 
solution and supplemented with 300 uL of the surfactant Silwet L-77 (0.05%). Plants 
5 which had their primary bolts clipped and had numerous secondary bolts with floral buds 
were used in the transformation. The above ground parts of the plant were submerged and 
mildly agitated for 3-5 seconds. The plants were then covered to maintain high humidity 
for 24 hr. The transformed plants were then grown normally and had seed catchers 
(modified plastic soft drink bottles) attached until the time of seed harvesting. 

10 

Selection of Tl transgenics 

Selection was carried out under phosphinothricin (PPT) selection (15 (xg/mL) on IX MS 
media supplemented with 1% w/v sucrose (Merck, Australia) and 1% w/v agar (Becton 
15 Dickenson, USA). 

In the following Examples, single and double nucleotide substitutions are made to GLI1 
leader sequences to introduce AUG or GUG pseudo-initiation sites. The Nv is also 
provided. 

20 

EXAMPLE 15 
Ma-UTR 

aguuuccagcccuggaccacgcaucccgagcaccgcgccccgacggaggucucuuugucc 
25 gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
gggacggccagcuggaggucugcgugguagagggaacuccagagacuguggauccccaag 
acugaacggcugcuucugcccacucuuugggauguuucuucuuaaggaagcugaaaaacg 
uuauugauuuccaugaccaguuucugagaugaggguuagagguccccucauccuucccug 
agacgcc [SEQ ID NO:41]. 
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N v = 4.6 

EXAMPLE 16 
Ma-UTR+1 AUG 

5 

aguuucAUgcccuggaccacgcaucccgagcaccgcgccccgacggaggucucuuugucc 
gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
gggacggccagcuggaggucugcgugguagagggaacuccagagacuguggauccccaag 
acugaacggcugcuucugcccacucuuugggauguuucuucuuaaggaagcugaaaaacg 
10 uuauugauuuccaugaccaguuucugagaugaggguuagagguccccucauccuucccug 
agacgcc [SEQ ID NO:42]. 

N v = 5.6 

EXAMPLE 17 
15 Ma~UTR + 2 AUG 

aguuucAUgcccAUgaccacgcaucccgagcaccgcgccccgacggaggucucuuugucc 
gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
gggacggccagcuggaggucugcgugguagagggaacuccagagacuguggauccccaag 
20 acugaacggcugcuucugcccacucuuugggauguuucuucuuaaggaagcugaaaaacg 
uuauugauuuccaugaccaguuucugagaugaggguuagagguccccucauccuucccug 
agacgcc [SEQ ID NO:43]. 

N v = 6.6 

25 

EXAMPLE 18 
Ma-UTR + 3AUG 

aguuucAUgcccAUgaccaUgcaucccgagcaccgcgccccgacggaggucucuuugucc 
30 gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
gggacggccagcuggaggucugcgugguagagggaacuccagagacuguggauccccaag 
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acugaacggcugcuucugcccacucuuugggauguuucuucuuaaggaagcugaaaaacg 
uuauugauuuccaugaccaguuucugagaugaggguuagagguccccucauccuucccug 
agacgcc [SEQ ID NO:44]. 

5 N v = 7.6 

EXAMPLE 19 
Ma-UTR + 4AUG 

10 aguuucAUgcccAUgaccaUgcaucccgagcaccgcgccccgacggaUgucucuuugucc 
gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
gggacggccagcuggaggucugcgugguagagggaacuccagagacuguggauccccaag 
acugaacggcugcuucugcccacucuuugggauguuucuucuuaaggaagcugaaaaacg 
uuauugauuuccaugaccaguuucugagaugaggguuagagguccccucauccuucccug 

15 agacgcc [SEQ ID NO:45]. 

N v = 8.6 

EXAMPLE 20 
20 Ma-UTR + 5 AUG 

aguuucAUgcccAUgaccaUgcaucccgagcaccgcgccccgacggaUgucucuAugucc 
gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
gggacggccagcuggaggucugcgugguagagggaacuccagagacuguggauccccaag 
25 acugaacggcugcuucugcccacucuuugggauguuucuucuuaaggaagcugaaaaacg 
uuauugauuuccaugaccaguuucugagaugaggguuagagguccccucauccuucccug 
agacgcc [SEQ ID NO:46]. 



N v = 9.6 

30 
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EXAMPLE 21 

aguuuccagccAuggaccacgcaucccgagcaccgcgccccgacggaggucucuuugucc 
5 gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
gggacggccagcuggaggucugcgugguagagggaacuccagguccccucauccuucccu 
gagacgcc [SEQ ID NO:47]. 

N v = 2.3 

10 EXAMPLE 22 

A//?-t/77? + 2^<7G 

aguuuccagccAuggaccacgcaucccgagcaccgcgccccgaUggaggucucuuugucc 
gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
15 gggacggccagcuggaggucugcgugguagagggaacuccagguccccucauccuucccu 
gagacgcc [SEQ ID NO:48]. 

Ny-3.3 

20 EXAMPLE 23 

MfiUTR+3AUG 

aguuuccagccAuggaccacgcaucccgagcaccgcgccccgaUggaggucucuuugucc 
gcgccucucccacauacuagaaaucucucccuuucuugagguugggaugaagaagcaguu 
25 gggacggccagcuggaggucugcgugguagagggaacuccagguccccucauccuuccAu 
gagacgcc [SEQ ID NO:49]. 

N v = 4.3 



30 
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EXAMPLE 24 
A/j> UTR + 1AUG 

aguuuccagccAuggaccacgcaucccgagcaccgcgccccgacggagguccccucaucc 
5 uucccugagacgcc [SEQ ID NO:50]. 

N v = 1 

EXAMPLE 25 

10 

aguuuccagccAuggaccaUgcaucccgagcaccgcgccccgacggagguccccucaucc 
uucccugagacgcc [SEQ ID NO:51]. 

N v -2 

15 

EXAMPLE 26 

aguuuccagccAuggaccaUgcaucccgagcaccgcgccccgaUggagguccccucaucc 
20 uucccugagacgcc [SEQ ID NO:52]. 

N v = 3 

EXAMPLE 27 
25 Hp UTR + 1 AUG 

agacuccagccAuggaccgcgcaucccgagcccagcgcccagacagagucuguguaucuc 
ugucucagggaaccgugggucuuugucuccgccucucccauauauuagaaauaucuuacu 
uucaugcgguuaaguugaagaggcuggagggauggcuagcuggaugucugcguuguagag 
30 agguaaccccaguguccccacacccuccucugagacgcc [SEQ ID NO:53]. 
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N v = 4.9 

EXAMPLE 28 
5 Hp UTR +2 AUG 

agacuccagccAuggaccgcgcauGccgagcccagcgcccagacagagucuguguaucuc 
ugucucagggaaccgugggucuuugucuccgccucucccauauauuagaaauaucuuacu 
uucaugcgguuaaguugaagaggcuggagggauggcuagcuggaugucugcguuguagag 
10 agguaaccccaguguccccacacccuccucugagacgcc [SEQ ID NO:54]. 

N v = 5.9 

EXAMPLE 29 
15 Hp UTR + 3 A UG 

agacuccagccAuggaccgcgcauGccgagcccagcgcccagacagagucuguguauGuc 
ugucucagggaaccgugggucuuugucuccgccucucccauauauuagaaauaucuuacu 
uucaugcgguuaaguugaagaggcuggagggauggcuagcuggaugucugcguuguagag 
20 agguaaccccaguguccccacacccuccucugagacgcc [SEQ ID NO:55]. 

N v = 6.9 

EXAMPLE 30 
25 Hy UTR + 1 AUG 

agacuccagccAuggaccgcgcaucccgagcccagcgcccagacagaguguccccacacc 
cuccucugagacgcc [SEQ ID NO:56]. 



30 Nv=1.3 
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EXAMPLE 31 
HyVTR + 2 AUG 

agacuccagccAuggaccgcgcauGccgagcccagcgcccagacagaguguccccacacc 
cuccucugagacgcc [SEQ ID NO:57]. 

N v = 2.3 

EXAMPLE 32 
H/UTR + 3AUG 

agacuccagccAuggaccgcgcauGccgagcccagcgcccagacagaguguccccacacc 
cuccucugagaUgcc [SEQ ID NO:58]. 

N v = 3.3 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
variations and modifications other than those specifically described. It is to be understood 
that the invention includes all such variations and modifications. The invention also 
includes all of the steps, features, compositions and compounds referred to or indicated in 
this specification, individually or collectively, and any and all combinations of any two or 
more of said steps or features. 
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TABLE 1 Primer sequences 1 



Name 


Location" 


Nucleotide Sequence 


mGliFl 


exon 1, mouse 


agtttccagccctggaccacg [SEQ ID NO:12] 


mGHR2 


exon 2, mouse 


ggcgtctcagggaaggatgag [SEQ ID NO: 13] 


hGHFl 


exon 1, human 


agactccagccc tggaccgcg [SEQ ID NO:14] 


hGHR2 


exon 2, human 


ggcgtctcagaggagggtgtg [SEQ ID NO: 15] 


RACE1 


exon 4, mouse 


gaggtgggaatcctaag [SEQ ID NO: 16] 


RACE2 


exon 2/3, mouse 


ccagaaagtcct tctgt tcccatgctgg [SEQ ID NO: 17] 


mGHFla 


exon la, mouse 


ctctccctttcttgaggttgg [SEQ ID NO: 18] 



5 

a Exons are numbered according to Figures IB and 4B 
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TABLE 2 Primer sequences 2 



Names 


ab 

Nucleotide Sequence 


mGliFl Nhe 


gctagcagtttccagccctggaccacg [SEQ ID NO: 29] 


mGliR2 Age 


accggtggcgtctcagggaaggatgag [SEQ ID NO: 30] 


mGliFl Bam 


ggatccagtttccagccctggaccacg [SEQ ID NO: 27] 


mGliR2 Bgl 


agatctggcgtctcagggaaggatgag [SEQ ID NO: 28] 


mGliMFl 


tcttgaggttgggttgaagaagcagtt [SEQ ID NO: 19] 


mGHMRl 


aactgcttcttcaacccaacctcaaga [SEQ ID NO:23] 


mGhMF2 


cccactctttgggttgtttcttcttaa [SEQ ID NO: 20] 


mGHMR2 


ttaagaagaaacaacccaaagagtggg [SEQ ID NO: 24] 


mGliMF3 


gttattgatttccttgaccagtttctg [SEQ ID NO:21] 


mGHMR3 


cagaaactggtcaaggaaatcaataac [SEQ ID NO: 25] 


mGHMF4 


accagtttctgagttgagggttagagg [SEQ ID NO: 22] 


mGliMR4 


cc tctaaccctcaactcagaaactggt [SEQ ID NO: 26] 



Primers mGHF 1 Nhe , mGliR2 Age and mGliF 1 Bam , mGliR2 Bgl are identical to mGHF 1 
5 and mGliR2 (Table 1) but include restriction sites for Nhel, Agel, BamWl and 

BgUl respectively (shown in bold). 

b The point mutations (A->T) introduced by primers mGliMFl -4 SEQ ID NOS: 19- 
22] and mGHMRl -4 [SEQ ID NOS:23-26] are shown underlined. 
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